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IMPROVED ENZYMES FOR THE DETECTION OF 
SPECIFIC NUCLEIC ACID SEQUENCES 

This invention was made with United States Government support under Cooperative 
Agreement Number 70NANB7H3015, awarded by the National Institute of Standards and 
Technology (NIST). The United States Government has certain rights in the invention. 

FIELD OF THE INVENTION 

The present invention relates to novel enzymes designed for direct detection, 
characterization and quantitation of nucleic acids, particularly RNA. The present invention 
provides enzymes that recognize specific nucleic acid cleavage structures formed on a target 
RNA sequence and that cleave the nucleic acid cleavage structure in a site-specific manner to 
produce non-target cleavage products. The present invention provides enzymes having an 
improved ability to specifically cleave a DNA member of a complex comprising DNA and 
RNA nucleic acid strands. 

BACKGROUND OF THE INVENTION 

In molecular medicine, a simple and cost-effective method for direct and quantitative 
RNA detection would greatly facilitate the analysis of RNA viruses and the measurement of 
specific gene expression. Both of these issues are currently pressing problems in the field. 
Despite this need, few techniques have emerged that are truly direct. PCR-based detection 
assays require conversion of RNA to DNA by reverse transcriptase before amplification, 
introducing a variable that can compromise accurate quantification. Furthermore, PCR and 
other methods based on exponential amplification (e.g., NASBA) require painstaking 
containment measures to avoid cross-contamination, and have difficulty distinguishing small 
differences (e.g., 2 to 3-fold) in quantity. Other tests that directly examine RNA suffer from 
a variety of drawbacks, including time consuming autoradiography steps (e.g., RNase 
protection assays), or overnight reaction times (e.g., branched DNA assays). With over 1.5 
million viral load measurements being performed in the U.S. every year, there is clearly an 
enormous potential for an inexpensive, rapid, high-throughput system for the quantitative 
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measurement of RNA. 

Techniques for direct, quantitative detection of mRNA are vital for monitoring 
expression of a number of different genes. In particular, levels of cytokine expression (e.g., 
interleukins and lymphokines) are being exploited as clinical measures of immune response in 
5 the progression of a wide variety of diseases (Van Deuren et al, J. Int. Fed. Clin. Chem., 

5:216 [1993], Van Deuren et al, J. Inf. Dis., 169:157 [1994], Perenboom et al, Eur. J. Clin. 
Invest., 26:159 [1996], Guidotti et al, Immunity 4:25 [1996]) as well as in monitoring 
transplant recipients (Grant et al, Transplantation 62:910 [1996]). Additionally, the 
monitoring of viral load and identification of viral genotype have great clinical significance 
10 for individuals suffering viral infections by such pathogens as HIV or Hepatitis C virus 
(HCV). There is a high correlation between viral load {i.e., the absolute number of viral 
particles in the bloodstream) and time to progression to AIDS (Mellors et al, Science 
3 272:1167 [1996], Saag et al, Nature Medicine 2:625 [1996]). For that reason, viral load, as 
Z;\ measured by quantitative nucleic acid based testing, is becoming a standard monitoring 
%5 procedure for evaluating the efficacy of treatment and the clinical status of HIV positive 
q patients. It is thought to be essential to reduce viral load as early in the course of infection as 
j f possible and to evaluate viral levels on a regular basis. In the case of HCV, viral genotype 
O has great clinical significance, with correlations to severity of liver disease and responsiveness 
fy to interferon therapy. Furthermore, because HCV cannot be grown in culture, it is only by 
: jO establishing correlations between characteristics like viral genotype and clinical outcome that 
O new antiviral treatments can be evaluated. 

While the above mentioned methods have been serviceable for low throughput, 
research applications, or for limited clinical application, it is clear that large scale quantitative 
analysis of RNA readily adaptable to any genetic system will require a more innovative 
25 approach. An ideal direct detection method would combine the advantages of the direct 

detection assays (e.g., easy quantification and minimal risk of carry-over contamination) with 
the specificity provided by a dual oligonucleotide hybridization assay. 

Many of the methods described above rely on hybridization alone to distinguish a 
target molecule from other nucleic acids. Although some of these methods can be highly 
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sensitive, they often cannot quantitate and distinguish closely related mRNAs accurately, 
especially such RNAs expressed at different levels in the same sample. While the 
above-mentioned methods are serviceable for some purposes, a need exists for a technology 
that is particularly adept at distinguishing particular RNAs from closely related molecules. 

5 

SUMMARY OF THE INVENTION 

The present invention relates to novel enzymes designed for direct detection, 
characterization and quantitation of nucleic acids, particularly RNA. The present invention 
provides enzymes that recognize specific nucleic acid cleavage structures formed on a target 
10 RNA sequence and that cleave the nucleic acid cleavage structure in a site-specific manner to 
produce non-target cleavage products. The present invention provides enzymes having an 
improved ability to specifically cleave a DNA member of a complex comprising DNA and 
^ RNA nucleic acid strands. 

^: For example, the present invention provides DNA polymerases that are altered in 

% &5 structure relative to the native DNA polymerases, such that they exhibit altered (e.g., 

improved) performance in detection assays based on the cleavage of a structure comprising 
nucleic acid (e.g., RNA). In particular, the altered polymerases of the present invention 
□ exhibit improved performance in detection assays based on the cleavage of a DNA member of 

fy a cleavage structure (e.g., an invasive cleavage structure) that comprises an RNA target strand. 
j|0 The improved performance in a detection assay may arise from any one of, or a 

O combination of several improved features. For example, in one embodiment, the enzyme of 
the present invention may have an improved rate of cleavage (k^) on a specific targeted 
structure, such that a larger amount of a cleavage product may be produced in a given time 
span. In another embodiment, the enzyme of the present invention may have a reduced 
25 activity or rate in the cleavage of inappropriate or non-specific structures. For example, in 
certain embodiments of the present invention, one aspect of improvement is that the 
differential between the detectable amount of cleavage of a specific structure and the 
detectable amount of cleavage of any alternative structures is increased. As such, it is within 
the scope of the present invention to provide an enzyme having a reduced rate of cleavage of 
30 a specific target structure compared to the rate of the native enzyme, and having a further 
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reduced rate of cleavage of any alternative structures, such that the differential between the 
detectable amount of cleavage of the specific structure and the detectable amount of cleavage 
of any alternative structures is increased. However, the present invention is not limited to 
enzymes that have an improved differential. 
5 In a preferred embodiment, the enzyme of the present invention is a DNA polymerase 

having an altered nuclease activity as described above, and also having altered synthetic 
activity, compared to that of any native DNA polymerase from which the enzyme has been 
derived. It is especially preferred that the DNA polymerase is altered such that it exhibits 
reduced synthetic activity as well as improved nuclease activity on RNA targets, compared to 
10 that of the native DNA polymerase. Enzymes and genes encoding enzymes having reduced 
synthetic activity have been described {See e.g., Kaiser et al 9 J. Biol. Chem., 274:21387 
[1999], Lyamichev et aL, Prot. Natl Acad. Sci., 96:6143 [1999], US. Patents 5,541,311, 
JJ 5,614,402, 5,795,763 and U.S. Patent application Ser. No. 08/758,314, incorporated herein by 
J;-! reference in their entireties). The present invention contemplates combined modifications, 
SI 5 such that the resulting 5' nucleases are without interfering synthetic activity, and have 
S improved performance in RNA detection assays. 

^ The present invention contemplates a DNA sequence encoding a DNA polymerase 

O altered in sequence relative to the native sequence, such that it exhibits altered nuclease 
?n activity from that of the native DNA polymerase. For example, in one embodiment, the DNA 
jf 0 sequence encodes an enzyme having an improved rate of cleavage (k cat ) on a specific targeted 
O structure, such that a larger amount of a cleavage product may be produced in a given time 
span. In another embodiment, the DNA encodes an enzyme having a reduced activity or rate 
in the cleavage of inappropriate or non-specific structures. In certain embodiments, one 
aspect of improvement is that the differential between the detectable amount of cleavage of a 
25 specific structure and the detectable amount of cleavage of any alternative structures is 
increased. It is within the scope of the present invention to provide a DNA encoding an 
enzyme having a reduced rate of cleavage of a specific target structure compared to the rate 
of the native enzyme, and having a further reduced rate of cleavage of any alternative 
structures, such that the differential between the detectable amount of cleavage of the specific 
30 structure and the detectable amount of cleavage of any alternative structures is increased. 



However, the present invention is not limited to polymerases that have an improved 
differential. 

In a preferred embodiment, the DNA sequence encodes a DNA polymerase having the 
altered nuclease activity described above, and also having altered synthetic activity, compared 
5 to that of any native DNA polymerase from which the improved enzyme is derived. It is 

especially preferred that the encoded DNA polymerase is altered such that it exhibits reduced 
synthetic activity as well as improved nuclease activity on RNA targets, compared to that of 
the native DNA polymerase. 

It is not intended that the invention be limited by the nature of the alteration required 
10 to introduce altered nuclease activity. Nor is it intended that the invention be limited by the 
extent of either the alteration, or in the improvement observed. If the polymerase is also 
altered so as to be synthesis modified, it is not intended that the invention be limited by the 
Jp polymerase activity of the modified or unmodified protein, or by the nature of the alteration 
fl to render the polymerase synthesis modified. 

""IS The present invention contemplates structure-specific nucleases from a variety of 

q sources, including, but not limited to, mesophilic, psychrophilic, thermophilic, and 

^ hyperthermophilic organisms. The preferred structure-specific nucleases are thermostable. 
O Thermostable structure-specific nucleases are contemplated as particularly useful in that they 
S] allow the INVADER assay (See e.g., U.S. Pat. Nos. 5,846,717, 5,985,557, 5,994,069, and 

p|0 6,001,567 and PCT Publications WO 97/27214 and WO 98/42873, incorporated herein by 
O reference in their entireties) to be operated near the melting temperature (TjJ of the 

downstream probe oligonucleotide, so that cleaved and uncleaved probes may cycle on and off 
the target during the course of the reaction. In one embodiment, the thermostable 
structure-specific enzymes are thermostable 5' nucleases that are selected from the group 
25 comprising altered polymerases derived from the native polymerases of Thermus species, 
including, but not limited to, Thermus aquaticus, Thermus flavus, Thermus thermophilus, 
Thermus filiformus, and Thermus scotoductus. However, the invention is not limited to the 
use of thermostable 5' nucleases. For example, certain embodiments of the present invention 
utilize short oligonucleotide probes that may cycle on and off of the target at low 
30 temperatures, allowing the use of non-thermostable enzymes. 



In some preferred embodiments, the present invention provides a composition 
comprising an enzyme, wherein the enzyme comprises a heterologous functional domain, 
wherein the heterologous functional domain provides altered (e.g., improved) functionality in 
a nucleic acid cleavage assay. The present invention is not limited by the nature of the 
5 nucleic acid cleavage assay. For example, nucleic acid cleavage assays include any assay in 
which a nucleic acid is cleaved, directly or indirectly, in the presence of the enzyme. In 
certain preferred embodiments, the nucleic acid cleavage assay is an invasive cleavage assay. 
In particularly preferred embodiments, the cleavage assay utilizes a cleavage structure having 
at least one RNA component. In another particularly preferred embodiment, the cleavage 
10 assay utilizes a cleavage structure having at least one RNA component, wherein a DNA 
member of the cleavage structure is cleaved. 

In some preferred embodiments, the enzyme comprises a 5' nuclease or a polymerase, 
yg In certain preferred embodiments, the 5' nuclease comprises a thermostable 5' nuclease. In 
^; other preferred embodiments, the polymerase is altered in sequence relative to a naturally 
'"95 occurring sequence of a polymerase such that it exhibits reduced DNA synthetic activity from 
r| that of the naturally occurring polymerase. In certain preferred embodiments, the polymerase 
j 1 " comprises a thermostable polymerase (e.g., a polymerase from a Thermus species including, 

but not limited to, Thermus aquaticus, Thermus flaws, Thermus thermophilus, Thermus 
m filiformus, and Thermus scotoductus). 

p|0 The present invention is not limited by the nature of the altered functionality provided 

Q by the heterologous functional domain. Illustrative examples of alterations include, but are 
not limited to, enzymes where the heterologous functional domain comprises an amino acid 
sequence (e.g., one or more amino acids) that provides an improved nuclease activity, an 
improved substrate binding activity and/or improved background specificity in a nucleic acid 
25 cleavage assay. 

The present invention is not limited by the nature of the heterologous functional 
domain. For example, in some embodiments, the heterologous functional domain comprises 
two or more amino acids from a polymerase domain of a polymerase (e.g., introduced into the 
enzyme by insertion of a chimeric functional domain or created by mutation). In certain 
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preferred embodiment, at least one of the two or more amino acids is from a palm or thumb 
region of the polymerase domain. The present invention is not limited by the identity of the 
polymerase from which the two or more amino acids are selected. In certain preferred 
embodiments, the polymerase comprises Thermits thermophilus polymerase. In particularly 
preferred embodiments, the two or more amino acids are from amino acids 300-650 of SEQ 
ID NO:267. 

The novel enzymes of the invention may be employed for the detection of target 
DNAs and RNAs including, but not limited to, target DNAs and RNAs comprising wild type 
and mutant alleles of genes, including, but not limited to, genes from humans, other animal, 
or plants that are or may be associated with disease or other conditions. In addition, the 
enzymes of the invention may be used for the detection of and/or identification of strains of 
microorganisms, including bacteria, fungi, protozoa, ciliates and viruses (and in particular for 
the detection and identification of viruses having RNA genomes, such as the Hepatitis C and 
Human Immunodeficiency viruses). For example, the present invention provides methods for 
cleaving a nucleic acid comprising providing: an enzyme of the present invention and a 
substrate nucleic acid; and exposing the substrate nucleic acid to the enzyme (e.g., to produce 
a cleavage product that may be detected). 

In one embodiment, the present invention provides a thermostable 5' nuclease having 
an amino acid sequence selected from the group comprising SEQ ID NOS:75, 77, 79, 81, 83, 

85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 106, 109, 112, 115, 118, 121, 124, 127, 130, 133, 
136, 139, 142, 145, 148, 150, 153, 157, 160, 163, 166, 169, 172, 175, 178, 181, 184, 187, 
190, 200, 202, 204, 206, 212, 214, 216, 218, 221, 226, 228, 230, 232, 234, 236, 239, 241, 
243, 251, 259, 261, 263, and 265. In another embodiment, the 5' nuclease is encoded by a 
DNA sequence selected from the group comprising of SEQ ID NOS:74, 76, 78, 80, 82, 84, 

86, 88, 90, 92, 94, 96, 98, 100, 102, 105, 108, 111, 114, 117, 120, 123, 126, 129, 132, 135, 
138, 141, 144, 147, 149, 152, 156, 159, 162, 165, 168, 171, 174, 177, 180, 183, 186, 189, 
199, 201, 203, 205, 211, 213, 215, 217, 220, 225, 227, 229, 231, 233, 235, 238, 240, 242, 
250, 258, 260, 262, and 264. 

The present invention also provides a recombinant DNA vector comprising DNA 
having a nucleotide sequence encoding a 5' nuclease, the nucleotide sequence selected from 



the group comprising SEQ ID NOS:74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 
102, 105, 108, 111, 114, 117, 120, 123, 126, 129, 132, 135, 138, 141, 144, 147, 149, 152, 
156, 159, 162, 165, 168, 171, 174, 177, 180, 183, 186, 189, 199, 201, 203, 205, 211, 213, 
215, 217, 220, 225, 227, 229, 231, 233, 235, 238, 240, 242, 250, 258, 260, 262, and 264. In 
5 a preferred embodiment, the invention provides a host cell transformed with a recombinant 
DNA vector comprising DNA having a nucleotide sequence encoding a structure-specific 
nuclease, the nucleotide selected from the group comprising sequence SEQ ID NOS:74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 105, 108, 111, 114, 117, 120, 123, 126, 
129, 132, 135, 138, 141, 144, 147, 149, 152, 156, 159, 162, 165, 168, 171, 174, 177, 180, 
10 183, 186, 189, 199, 201, 203, 205, 211, 213, 215, 217, 220, 225, 227, 229, 231, 233, 235, 

238, 240, 242, 250, 258, 260, 262, and 264. The invention is not limited by the nature of the 
host cell employed. The art is well aware of expression vectors suitable for the expression of 
yi nucleotide sequences encoding structure-specific nucleases that can be expressed in a variety 
s f-l of prokaryotic and eukaryotic host cells. In a preferred embodiment, the host cell is an 
^15 Escherichia coli cell. 

□ The present invention provides a method of altering 5 ? nuclease enzymes relative to 

^ native 5' nuclease enzymes, such that they exhibit improved performance in detection assays 
O based on the cleavage of a structure comprising RNA. In particular, the altered 5' nucleases 
f y produced by the method of the present invention exhibit improved performance in detection 
;||0 assays based on the cleavage of a DNA member of a cleavage structure (e.g., an invasive 
O cleavage structure) that comprises an RNA target strand. The improved 5 ? nucleases resulting 
from the methods of the present invention may be improved in any of the ways discussed 
herein. Examples of processes for assessing improvement in any candidate enzyme are 
provided. 

25 For example, the present invention provides methods for producing an altered enzyme 

with improved functionality in a nucleic acid cleavage assay comprising: providing an 
enzyme and a nucleic acid test substrate; introducing a heterologous functional domain into 
the enzyme to produce an altered enzyme; contacting the altered enzyme with the nucleic acid 
test substrate to produce cleavage products; and detecting the cleavage products. In some 

30 embodiments, the introduction of the heterologous functional domain comprises mutating one 
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or more amino acids of the enzyme. In other embodiments, the introduction of the 
heterologous functional domain into the enzyme comprises adding a functional domain from a 
protein (e.g., another enzyme) into the enzyme (e.g., substituting functional domains by 
removing a portion of the enzyme sequence prior to adding the functional domain of the 
5 protein). In preferred embodiments, the nucleic acid test substrate comprises a cleavage 
structure. In particularly preferred embodiment, the cleavage structure comprises an RNA 
target nucleic acid. In yet other preferred embodiments, the cleavage structure comprises an 
invasive cleavage structure. 

The present invention also provides nucleic acid treatment kits. One preferred 
10 embodiment is a kit comprising a composition comprising at least one improved 5' nuclease. 
Another preferred embodiment provides a kit comprising: a) a composition comprising at 
least one improved 5' nuclease; and b) an INVADER oligonucleotide and a signal probe 
^ oligonucleotide. In some embodiments of the kits of the present invention, the improved 5' 

nuclease is derived from a DNA polymerase from a eubacterial species. In further 
%|5 embodiments, the eubacterial species is a thermophile. In still further embodiments, the 
H thermophile is of the genus Thermus. In still further embodiments, the thermophile is selected 
^ from the group consisting of Thermus aquaticus, Thermus flaws, Thermus thermophilus, 
C3 Thermus filiformus, and Thermus scotoductus. In preferred embodiments, the improved 5' 

f?« nuclease is encoded by DNA selected from the group comprising SEQ ID NOS:74, 76, 78, 
jo 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 105, 108, 111, 114, 117, 120, 123, 126, 129, 
£ 132, 135, 138, 141, 144, 147, 149, 152, 156, 159, 162, 165, 168, 171, 174, 177, 180, 183, 

186, 189, 199, 201, 203, 205, 211, 213, 215, 217, 220, 225, 227, 229, 231, 233, 235, 238, 
240, 242, 250, 258, 260, 262, and 264. In yet other preferred embodiments, the kits further 
comprise reagents for detecting a nucleic acid cleavage product. In further preferred 
25 embodiments, the reagents for detecting a cleavage product comprise oligonucleotides for use 
in a subsequent invasive cleavage reaction (See e.g., U.S. Patent No. 5,994,069). In 
particularly preferred embodiments, the reagents for the subsequent invasive cleavage reaction 
comprise a probe labeled with moieties that produce a fluorescence resonance energy transfer 
(FRET) effect. 



DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic representation of sequential invasive cleavage reactions. 
In step A, an upstream INVADER oligonucleotide and a downstream probe combine with a 
target nucleic acid strand to form a cleavage structure. In step B, the portion of the cleaved 
5 signal probe from A combines with a second target nucleic acid strand and a labeled signal 

probe to form a second cleavage structure. In step C, cleavage of the labeled second cleavage 
structure yields a detectable signal. 

Figure 2 shows schematic representations of several examples of invasive cleavage 
structures comprising RNA target strands (SEQ ID NO: 9). Panel A depicts an INVADER 
10 oligonucleotide (SEQ ID NO: 10) and probe (SEQ ID NO:ll). Panel B depicts an INVADER 
oligonucleotide (SEQ ID NO: 12) and probe (SEQ ID NO: 11). Panel C depicts an INVADER 
oligonucleotide (SEQ ID NO: 13) and probe (SEQ ID NO: 13). Panel D depicts an INVADER 
JJ oligonucleotide (SEQ ID NO: 13) and probe (SEQ ID NO: 14). 

Figure 3 shows schematic representations of two examples of structures that are not 
% 15 invasive cleavage structures labelled SEQ ID NOs:269-274. 

S Figure 4 shows a schematic representation of a configuration of invasive cleavage that 

+ is useful for detection of target sequence variations. In A, an invasive cleavage structure 
O having overlap between the two probes is formed, and the arrow indicates that it is cleavable 

Sjjj by the enzymes of the present invention. In B, variation of the target sequence removes a 
4§0 region of complementarity to the downstream probe and eliminates the overlap. The absence 
Q of an arrow in panel B indicates a reduced rate of cleavage of this structure compared to that 

diagrammed in panel A. 

Figure 5 shows a diagram of the X-ray structure of a ternary complex of Klentaql 

with primer/template DNA in the polymerizing mode determined by Li et al (Li et al. 9 
25 Protein Sci., 7:1116 [1998]). Without intending to represent precise borders between features 

of the physical form, the portions referred to in the text as the "fingers", "thumb" and "palm" 

regions are loosely indicated by the circle, rectangle, and oval, respectively. 

Figure 6 shows a schematic diagram of the DNA polymerase gene from Thermus 

aquaticus. Restriction sites used in these studies are indicated above. The approximate 
30 regions encoding various structural or functional domains of the protein are indicated by 
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double-headed arrows, below. 

Figure 7 shows a schematic diagram of the chimeric constructs comprising portions of 
the TaqPol gene and the TthPol gene. Open and shaded boxes denote TaqPol and TthPol 
sequences, respectively. The numbers correspond to the amino acid sequence of TaqPol. The 
5 ? nuclease and polymerase domains of TaqPol and the palm, thumb, and fingers regions of 
the polymerase domain are indicated. The abbreviations for the restrictions sites used for 
recombination are as follows: E, EcoRI; N, NotI; Bs 5 BstBI; D, Ndel; B, BamHI; and S, 
Sail. 

Figure 8 is a comparison of the nucleotide structure of the polymerase genes isolated 
from Thermus aquaticus (SEQ ID NO:l), Thermus flavus (SEQ ID NO:2) and Thermus 
thermophilics (SEQ ID NO:266); the consensus sequence (SEQ ID NO:7) is shown at the top 
of each row. 

Figure 9 is a comparison of the amino acid sequence of the polymerase isolated from 
Thermus aquaticus (SEQ ID NO:4), Thermus flavus (SEQ ID NO:5), and Thermus 
thermophilus (SEQ ID NO:267); the consensus sequence (SEQ ID NO:8) is shown at the top 
of each row. 

Figure 10 shows the sequences and proposed structures of substrates for the invasive 
signal amplification reaction with human IL-6 RNA target strand (SEQ ID NO: 17) and 
upstream probe (SEQ ID NO: 15). The cleavage site of the downstream probe (SEQ ID 
NO: 16) is indicated by an arrow. Sequence of the IL-6 DNA target strand (SEQ ID NO: 18) 
is shown below. 

Figure 1 1 shows the image generated by a fluorescence imager showing the products 
of invasive cleavage assays using the indicated enzymes, and the IL-6 substrate of Figure 10 
having either a DNA target strand (A) or an RNA target strand (B). 

Figure 12 compares the cycling cleavage activities of Taq DN RX HT, Tth DN RX 
HT, and Taq-Tth chimerical enzymes with IL-6 substrate having an RNA target strand. 

Figure 13 shows a comparison of the amino acid sequences of the Bstl-BamHI 
fragments of TaqPol (SEQ ID NO:19) and TthPol (SEQ ID NO:20). Pairs of similar amino 
acids are shaded with light gray. Aligned amino acids that have a charge difference are 
shaded with dark gray. The numbers correspond to the amino acid sequence of TaqPol. 
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Amino acids of TaqPol changed to the corresponding amino acids of TthPol by site-directed 
mutagenesis are indicated by (+). 

Figure 14 compares the cycling cleavage activities of Taq DN RX HT, Taq-Tth 
chimerical enzymes, and chimerical enzymes having the indicated additional amino acid 
modifications, with IL-6 substrate having an RNA target strand. 

Figure 15 compares the cycling cleavage activities of Taq DN RX HT, Tth DN RX 
HT, and Taq DN RX HT having the indicated amino acid modifications, with IL-6 substrate 
having an RNA target strand. 

Figure 16 compares polymerization activities of TaqPol, TthPol, and Taq-Tth 
chimerical enzymes, and TaqPol having the indicated amino acid modifications. 

Figure 17 shows a diagram of the X-ray structure of a ternary complex of Klentaql 
with primer/template DNA in the polymerizing mode determined by Li et al (Li et al 9 
Protein Sci., 7:1116 [1998]). Amino acids G418 and E507 are indicated. 

Figures 18 A-D show schematic diagrams of examples of substrates that may be used 
to measure various cleavage activities of enzymes. The substrates may be labeled, for 
example, with a fluorescent dye and a quenching moiety for FRET detection, as shown, to 
facilitate detection and measurement. The substrates of 18A and 18B are invasive cleavage 
structures having RNA and DNA target strands, respectively. 18C shows an example of an 
X-structure, and 18D shows an example of a hairpin structure, both of which may be used to 
assess the activity of enzymes on alternative structures that may be present in invasive 
cleavage reactions. 

Figure 19 shows schematic diagrams of chimeric constructs comprising portions of the 
TaqPol gene and the TthPol gene. Open and shaded boxes denote TaqPol and TthPol 
sequences, respectively. The chimeras also include the DN, RX, and HT modifications. A 
table compares the cleavage activity of each protein on the indicated cleavage substrates. 

Figure 20A shows a schematic diagram for an RNA containing invasive cleavage 
substrate. The 5' end of the target molecule (SEQ ID NO:27) is modified with biotin and 
blocked with streptavidin as described. The downstream probe (SEQ ID NO:26) with 
cleavage site is also shown. Panels B-D show analysis of the properties of the Taq DN RX 
HT G418K/E507Q mutant in cleavage of the shown substrate under conditions of varying 
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reaction temperature, KC1 concentration, and MgS0 4 concentration. 

Figure 21 shows schematic diagrams for model substrates used to test enzymes for 
invasive cleavage activity. The molecule shown in 21 A provides a DNA target strand (SEQ 
ID NO:28), while the model shown in 21B provides an RNA containing target strand (SEQ 
ID NO:27). Both 21A and B show downstream probe SEQ ID NO:26. 

Figure 22 shows schematic diagrams for model substrates used to test enzymes for 
cleavage activity on alternative, non-invasive structures. 

Figure 23 shows a schematic diagram for a model substrate used to test enzymes for 
invasive cleavage activity. 

Figure 24 shows schematic diagrams for a model substrate used to test enzymes for 
invasive cleavage activity on an RNA or DNA target strands. 

Figure 25 compares the cycling cleavage activities of Tth DN RX HT, Taq 2M, 
TfiPol, Tsc Pol, and Tfi and Tsc-derived mutant enzymes. 

GENERAL DESCRIPTION OF THE INVENTION 

The INVADER technology (See e.g., U.S. Pat. Nos. 5,846,717, 5,985,557, 5,994,069, 
and 6,001,567 and PCT Publications WO 97/27214 and WO 98/42873, incorporated herein by 
reference in their entireties) provides a signal amplification system that can be applied to the 
detection and quantitation of specific nucleic acid sequences, including single point mutations 
and similar variants of mRNA. Further, because this technology does not rely exclusively on 
allele-specific hybridization, it is well suited for quantitating closely related RNAs in the same 
sample. The present invention provides improved enzymes and methods for creating enzymes 
for the INVADER assay-based detection of nucleic acids, particularly RNA nucleic acids. 
The present invention also provides kits for the performance of INVADER assays using the 
improved enzymes of the present invention. 

The INVADER technology was developed for quantitative detection of DNA and 
RNA, without prior amplification of the target nucleic acid (Lyamichev et al, Nat. 
BiotechnoL, 17:292 [1999]). In addition to its use for the quantitative measurement of 
specific nucleic acid sequences, high specificity provides the capability of detecting single 
base changes. The basis of the INVADER technology is the cleavage of DNA and RNA 

13 



molecules at specific locations in response to structure rather than sequence. Cleavage is 
typically catalyzed by a 5* nuclease enzyme. The 5' nuclease enzymes recognize a precise 
structure that is formed when two oligonucleotide probes, an upstream INVADER probe and a 
downstream signal probe, hybridize in tandem to a nucleic acid target to generate the substrate 
complex (Figure 1A). The high specificity of the INVADER technology arises from 
combining sequence-specific probe hybridization with structure-specific enzymatic cleavage. 
The substrate complex contains a feature that is important for precise enzyme recognition: an 
overlap between the hybridized oligonucleotides. To form an invasive structure, the 3 5 end of 
the upstream INVADER oligonucleotide must overlap with the hybridized region of the 
signal probe by at least one base (Lyamichev et aL, Nat. Biotechnol., 17:292 [1999]). This 
overlap may be created by a duplication of sequence between the 3 5 portion of the upstream 
INVADER oligonucleotide and the 5' portion of the target-complementary region of the 
downstream probe oligonucleotide. The region of sequence so duplicated may be as small as 
a single base. Regardless of the length of the duplicated sequence (i.e., the overlap) the 3' 
terminal base of the upstream INVADER oligonucleotide need not be complementary to the 
target strand, and may be any nucleotide. In some embodiments, this terminal nucleotide may 
be replaced by a moiety having chemical features similar to a nucleotide such as a nucleotide 
analog or an organic ring compound (See e.g., U.S. Pat. No. 5,985,557). In an alternative 
embodiment, the overlap need not involve any duplication of sequence between the 
target-complementary regions of the two probes (Lyamichev et aL, Nat. Biotechnol, 17:292 
[1999] and U.S. Patent 5,985,557). In this embodiment, the INVADER and signal probes 
have regions complementary to adjacent regions of the target that are contiguous and that do 
not overlap. When no sequence is shared, the 3' end of the upstream INVADER 
oligonucleotide includes at least one additional nucleotide or nucleotide-like analog that is not 
complementary to the target strand (Lyamichev et aL 9 Nat. Biotechnol, 17:292 [1999]). This 
can be referred to as a physical overlap, in contrast to a sequence overlap. An overlap of 
either type will satisfy the requirement for overlap that is the hallmark of the invasive 
cleavage of the INVADER assay. Several of these embodiments are shown schematically in 
Figure 2. In contrast to the overlap configurations described above, if the probes have regions 
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complementary to adjacent regions of the target that are contiguous and that do not overlap, 
and the 3' end of the upstream oligonucleotide does not have any additional base or moiety, 
the invasive structure is not formed (Figure 3 A). Even the presence of one or more additional 
bases on the 5' end of the downstream oligonucleotide that are not complementary to the 
5 target strand will not create the requisite overlap. This latter structure (Figure 3B), as is 

described in U.S. 5,874,283 is not an "invasive cleavage," although such structures find use 
in certain embodiments of the present invention. 

Some 5' nucleases may not require an upstream oligonucleotide to be active in a 
cleavage reaction. Although cleavage may be slower without the upstream oligonucleotide, it 
10 may still occur (Lyamichev et al 9 Science 260:778 [1993], Kaiser et al 9 J. Biol. Chem., 
274:21387 [1999]). When a DNA strand is the template or target strand to which probe 
oligonucleotides are hybridized, the 5' nucleases derived from DNA polymerases and some 
y§ flap endonucleases (FENs), such as that from Methanococcus jannaschii, can cleave quite well 
without an upstream oligonucleotide providing an overlap (Lyamichev et al 9 Science 260:778 
^5 [1993], Kaiser et aL 9 J. Biol. Chem., 274:21387 [1999], and US Patent No. 5,843,669, herein 
S incorporated by reference in its entirety). Other FENs, such as those from Archeaoglobus 
^ fulgidus (Afu) and Pyrococcus furiosus (Pfu), cleave an overlapped structure on a DNA target 
O at so much greater a rate than they do a non-overlapping structure (Le., either missing the 
fU upstream oligonucleotide or having a non-overlapping upstream oligonucleotide) that they can 

jO be viewed as having an essentially absolute requirement for the overlap (Lyamichev et al 9 
O Nat. Biotechnol, 17:292 [1999], Kaiser et al 9 J. Biol. Chem., 274:21387 [1999]). When an 
RNA target is hybridized to DNA oligonucleotide probes to form a cleavage structure, many 
FENs cleave the downstream DNA probe poorly, regardless of the presence of an overlap. 
On such an RNA-containing structure, the 5' nucleases derived from DNA polymerases have 
25 a strong requirement for the overlap, and are essentially inactive in its absence. 

Performing the INVADER assay under conditions that have a tight requirement for an 
overlap (e.g., using the Afu FEN for DNA target detection or the 5' nuclease of Tth DNA 
polymerase for RNA target detection) provides a superior means of detecting single nucleotide 
or other sequence variations. In one embodiment, the signal probe is selected such that the 
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target base suspected of varying is positioned at the 5' end of the target-complementary region 
of this probe. The upstream INVADER oligonucleotide is positioned to provide a single base 
of overlap. If the target and the signal probe are complementary at the base in question, the 
overlap forms and cleavage can occur. This embodiment is diagrammed in Figure 4A. 
However, if the target does not complement the probe at this position, that base in the probe 
becomes part of a non-complementary 5' arm, no overlap between the probes exists, and 
cleavage is suppressed. This embodiment is diagrammed in Figure 4B. In any of the 
aforementioned embodiments, the downstream probe may optionally include a region that is 
not complementary to the target. In a preferred embodiment, this non target-complementary 
region is on the 5' end of the probe and produces an unpaired 5' flap when the signal probe 
is hybridized to the target. Upon cleavage by a CLEAVASE enzyme, a released 5' flap can 
be incorporated into a subsequent INVADER reaction for further amplification of the signal 
(See e.g., U.S. Patent 5,994,069 and PCT Publication WO 98/42873, incorporated herein by 
reference in their entireties). One way it may be used is as an INVADER oligonucleotide, 
which may combine with a provided secondary target and a secondary probe. Upon 
hybridization of the 5' flap released by the CLEAVASE enzyme in the first invasive cleavage 
reaction, a secondary invasive structure complex is completed, so that it may be recognized by 
the CLEAVASE enzyme and the secondary probe oligonucleotide may be cleaved 
(Kwiatkowski et aL, Molec. Diagn., 4:353 [1999]), Figure 1. 

INVADER assays often use thermostable CLEAVASE enzymes, allowing reactions to 
be operated near the melting temperature (TJ of the downstream probe oligonucleotide, so 
that cleaved and uncleaved probes cycle on and off the target during the course of the 
reaction. In a preferred embodiment, a longer INVADER oligonucleotide may not readily 
cycle. Each time a full-length probe binds to the target in the presence of the INVADER 
oligonucleotide it can be cleaved, resulting in an accumulation of cleavage product that is 
both highly specific for the sequence being detected, and that is generally proportional with 
respect to both time and target concentration. The target is generally the limiting component 
in an invasive cleavage, since the INVADER and signal probe oligonucleotides are generally 
supplied in molar excess. In a second linked invasive cleavage, it is the component created in 
the first cleavage reaction (e.g., a released 5' flap) that is limiting. When two such cleavage 
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reactions are performed sequentially, the signal from the composite reaction accumulates 
linearly with respect to the amount of target nucleic acid while the reaction sequence results 
in a tremendous increase in signal amplification (Kwiatkowski et al 3 Molec. Diagn., 4:353 
[1999]). 

Several of the 5' nuclease domains of eubacterial Pol A DNA polymerases and 
structurally homologous DNA repair proteins, called flap endonucleases (FENs) can function 
to cleave the secondary structure formed between the INVADER and signal probe 
oligonucleotides (Kaiser et al, J. Biol. Chem., 274:21387 [1999], Xu et al, J. Biol. Chem., 
published online as 10.1074/jbc.M909135199 at ww.jbc.org/pips/pips.2,shtml, May 9, 2000). 
Both classes of enzymes contain a putative helix-hairpin-helix (HhH) DNA binding motif 
important for sequence independent, structure-based recognition of DNA (Doherty et al 9 
Nucl. Acid. Res., 24:2488 [1996]). This type of DNA binding motif is suitable for assays 
performed on DNA targets, but can be problematic for assays with RNA targets, resulting in 
lower assay sensitivity. New enzymes having improved recognition of the invasive cleavage 
structure formed on an RNA target strand would vastly improve the performance of the 
INVADER assay in the detection and quantitation of RNA targets. 

A number of enzyme improvements related to 5' nucleases and DNA polymerases 
have been described. For example, DNA polymerases having altered 5 5 nuclease activity, or 
lacking 5' nuclease activity altogether have been described (U.S. Patents 5,466,591 and 
5,795,762, each of which is incorporated herein by reference in its entirety). These patents 
relate to thermostable DNA polymerases that exhibit a different level of 5' to 3' exonuclease 
activity than their respective native polymerases. In some embodiments, particular conserved 
amino acid domains in thermostable DNA polymerases are mutated or deleted to alter the 5' 
to 3' exonuclease activity of the polymerases. 

DNA polymerases altered relative to the native polymerases such that they exhibit 
altered DNA synthetic activity have been described (Kaiser et aL 9 J. Biol. Chem., 274:21387 
[1999], Lyamichev et aL 9 Proc. Natl. Acad. Sci., 96:6143 [1999], US. Patents 5,541,311, 
5,614,402, 5,795,763 and U.S. Patent application Ser. No. 08/758,314, incorporated herein by 
reference in their entireties). In preferred embodiments, these DNA polymerases are altered 
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such that they exhibit reduced synthetic activity compared to that of the native DNA 
polymerase. In this respect, enzymes have been created that are predominantly 5' nucleases 
and are capable of cleaving nucleic acids in a structure-specific manner in the absence of 
interfering synthetic activity. The alterations made in these polymerases were not selected 
with respect to their effect of the cleavage of structures comprising RNA. 

DNA polymerases having the ability to use RNA as a template strand, known as 
reverse transcriptases, are usually associated with an RNase activity that specifically cleaves 
RNA basepaired in a heteroduplex with a DNA strand. Such RNase activity is generally 
termed RNase H. Altered reverse transcriptases that have this RNase H activity removed have 
been described (See e.g., U.S. Patent 5,244,797, incorporated herein by reference in its 
entirety). This patent relates to a gene that encodes reverse transcriptase having DNA 
polymerase activity and little or no RNase H activity. The invention also relates to a method 
of producing cDNA from mRNA using the reverse transcriptase. This patent does not 
describe enzymes having improved ability to cleave a DNA member of a structure comprising 
DNA and RNA strands, nor does it relate to enzymes having improved performance in 
detection assays based on the cleavage of a DNA member of a structure that comprises an 
RNA target strand. 

Thermostable RNase H enzymes have been described (e.g., U.S. Patent Nos. 
5,268,289, 5,459,055 and 5,500,370, incorporated herein by reference in their entireties). 
These thermostable enzymes cleave the RNA member of a heteroduplex comprising DNA and 
RNA strands. These patents do not describe enzymes having improved ability to cleave a 
DNA member of a structure comprising DNA and RNA strands, nor do they relate to 
enzymes having improved performance in detection assays based on the cleavage of a DNA 
member of an invasive structure that comprises an RNA target strand. 

There remains a need for enzymes having an improved ability to cleave DNA 
members of structures comprising RNA and DNA strands. In particular, there remains a need 
for thermostable enzymes having improved performance in detection assays based on the 
cleavage of DNA members of invasive complexes comprising an RNA target strand. 
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DEFINITIONS 

To facilitate an understanding of the present invention, a number of terms and phrases 
are defined below: 

As used herein, the term "functional domain" refers to a region, or a part of a region, 
of a protein (e.g., an enzyme) that provides one or more functional characteristic of the 
protein. For example, a functional domain of an enzyme may provide, directly or indirectly, 
one or more activities of the enzyme including, but not limited to, substrate binding capability 
and catalytic activity. A functional domain may be characterized through mutation of one or 
more amino acids within the functional domain, wherein mutation of the amino acid(s) alters 
the associated functionality (as measured empirically in an assay) thereby indicating the 
presence of a functional domain. 

As used herein, the term "heterologous functional domain" refers to a protein 
functional domain that is not in its natural environment. For example, a heterologous 
functional domain includes a functional domain from one enzyme introduced into another 
enzyme. A heterologous functional domain also includes a functional domain native to an 
protein that has been altered in some way (e.g., mutated, added in multiple copies, etc,)« A 
heterologous functional domain may comprise a plurality of contiguous amino acids or may 
include two or more distal amino acids are amino acids fragments (e.g., two or more amino 
acids or fragments with intervening, non-heterologous, sequence). Heterologous functional 
domains are distinguished from endogenous functional domains in that the heterologous amino 
acid(s) are joined to amino acid sequences that are not found naturally associated with the 
amino acid sequence in nature or are associated with a portion of a protein not found in 
nature. 

As used herein, the term "altered functionality in a nucleic acid cleavage assay" refers 
to a characteristic of an enzyme that has been altered in some manner to differ from its 
natural state (e.g., to differ from how it is found in nature). Alterations include, but are not 
limited to, addition of a heterologous functional domain (e.g., through mutation or through 
creation of chimeric proteins). In some embodiments, the altered characteristic of the enzyme 
may be one that improves the performance of an enzyme in a nucleic acid cleavage assay. 
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Types of improvement include, but are not limited to, improved nuclease activity (e.g., 
improved rate of reaction), improved substrate binding (e.g., increased or decreased binding of 
certain nucleic acid species [e.g., RNA or DNA] that produces a desired outcome [e.g., 
greater specificity, improved substrate turnover, etc.]), and improved background specificity 
(e.g., less undesired product is produced). The present invention is not limited by the nucleic 
cleavage assay used to test improved functionality. However, in some preferred embodiments 
of the present invention, an invasive cleavage assay is used as the nucleic acid cleavage assay. 
In certain particularly preferred embodiments, an invasive cleavage assay utilizing an RNA 
target is used as the nucleic acid cleavage assay. 

The terms "signal oligonucleotide" and "signal probe" are used interchangeably, and 
refer to a member of a cleavage structure that is cleaved by a cleavage agent. In an "invasive 
cleavage structure," the signal probe may be referred to as the "downstream probe" in a 
complex in that, with the exception of its overlapping portion, it is largely 3' of an 
INVADER oligonucleotide within the complex. On the target strand, the hybridization site of 
a signal probe is said to be upstream of the hybridization site of an INVADER 
oligonucleotide. 

The terms "INVADER" oligonucleotide or probe and "invasive" oligonucleotide or 
probe may be used interchangeably, and refer to a member of an invasive cleavage structure 
that hybridizes to a target nucleic acid strand and that overlaps the 5' end of a signal probe. 
In an invasive cleavage structure, the INVADER oligonucleotide may be referred to as the 
upstream oligonucleotide or probe in the complex in that, with the exception of its 
overlapping portion, it is largely 5' of a signal probe within the complex. On the target 
strand, the hybridization site of an INVADER oligonucleotide is downstream of the 
hybridization site of a signal probe. 

The term "invasive cleavage structure" refers to a cleavage structure formed by the 
hybridization of two probes to a target nucleic acid, wherein the upstream probe overlaps the 
downstream probe by at least one base. 

The term "DNA polymerase" as used herein refers to a protein that is encoded by a 
gene that is derived from the gene for a naturally occurring enzyme, said naturally occurring 
enzyme having nucleic acid synthetic activity. In some embodiments, the gene for the DNA 
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polymerase is altered relative to the gene for the naturally occurring enzyme, such that the 
DNA polymerase lacks or has reduced synthetic activity. 

The term "synthetic activity 11 refers to the ability of an enzyme to catalyze DNA 
synthesis by addition of deoxyribonucleotide or deoxyribonucletide analog units to a DNA 
chain using DNA or RNA as a template. 

The term "target-complementary" refers to feature of a nucleic acid or a portion of an 
nucleic acid, said feature being that it is complementary to a target nucleic acid. 

The terms "RNA-dependent" and "DNA-dependent", when used with reference to an 
enzymatic activity, refer to activity that occurs in response to the presence of an RNA or 
DNA target strand, respectively. A single enzyme may possess both types of activities and 
the presence of one type of activity does not imply the absence of any other activity. For 
example, description of an enzyme as having RNA-dependent 5 J nuclease activity is not 
meant to indicate the absence of a DNA-dependent 5' nuclease activity, or any other activity. 
Similarly, an enzyme having a DNA-dependent activity may also have RNA-dependent and 
other activities. 

The terms "target" and "template" are used interchangeably, and refer to a nucleic acid 
to be detected or analyzed. In some embodiments, the target is a nucleic acid to which one or 
more oligonucleotides or probes are hybridized. In a cleavage embodiment, one or more 
oligonucleotides may hybridize to form a cleavage structure, the formation of which may be 
used for detection or analysis of said target nucleic acid. The target nucleic acids include but 
are not limited to, single and double stranded DNA or RNA, modified nucleic acids (e.g., 
methylated nucleic acids), chimeric nucleic acids, peptide nucleic acids and the like. 
"Nucleic acid molecule" refers to any nucleic acid containing molecule. The term 
encompasses sequences that include base analogs of DNA and RNA including, but not limited 
to, 4-acetylcytosine, 8-hydroxy-N6-methyladenosine, aziridinylcytosine, pseudoisocytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-fluorouracil, 5-bromouracil, 

5-carboxymethylaminomethyl-2-thiouracil, 5-carboxymethylaminomethyluracil, dihydrouracil, 
inosine, N6-isopentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 
1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 
5-methylcytosine, N6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 
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5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 
5'-methoxycarbony]methyluracil, 5-methoxyuracil, 2-methyltbio-N6-isopentenyladenine, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
N-uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 
2-thiocytosine, and 2,6-diaminopurine. 

The terra "overlap" as used herein in reference to cleavage structures, refers to a 
feature of an invasive cleavage structure, said feature being that the 3' end of an upstream 
INVADER oligonucleotide overlaps with the hybridized region of the signal probe by at least 
one base. An overlap may be created by a duplication of sequence between the 3' portion of 
the upstream INVADER oligonucleotide and the 5' portion of the target-complementary 
region of the downstream probe oligonucleotide. The region of sequence so duplicated may 
be as small as a single base. Regardless of the length of the duplicated sequence (i.e., the 
overlap) the 3' terminal base of the upstream INVADER oligonucleotide need not be 
complementary to the target strand, and may be any nucleotide. In some embodiments, this 
terminal nucleotide may be replaced by a moiety having chemical features similar to a 
nucleotide such as a nucleotide analog or an aromatic ring compound. Indeed, any chemical 
moiety that provides an overlap and promotes efficient invasive cleavage is contemplated. In 
an alternative embodiment, the overlap need not involve any duplication of sequence between 
the target-complementary regions of the two probes (Lyamichev et al, Nat. Biotechnol., 
17:292 [1999], U.S. Patent 5,985,557). In this embodiment, the INVADER and signal probes 
have regions complementary to adjacent regions of the target that are contiguous and that do 
not overlap. When no sequence is shared, the 3' end of the upstream INVADER 
oligonucleotide includes at least one additional nucleotide or nucleotide-like analog that is not 
complementary to the target strand. This can be referred to as a physical overlap, in contrast 
to a sequence overlap. An overlap of either type will satisfy the requirement for overlap of 
the invasive cleavage of the INVADER assay. 

As used herein, the terms "N-terminal" and "C-terminal" in reference to polypeptide 
sequences refer to regions of polypeptides including portions of the N-terminal and C-terminal 
regions of the polypeptide, respectively. A sequence that includes a portion of the N-terminal 
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region of polypeptide includes amino acids predominantly from the N-terminal half of the 
polypeptide chain, but is not limited to such sequences. For example, an N-terminal sequence 
may include an interior portion of the polypeptide sequence including bases from both the N- 
terminal and C-terminal halves of the polypeptide. The same applies to C-terminal regions. 
One example of such a sequence if shown in Figure 6, wherein the "polymerase domain" may 
be referred to a C-terminal region of the polypeptide even though it contains amino acids that 
are part of the N-terminal half of the polypeptide. N-terminal and C-terminal regions may, 
but need not, include the amino acid defining the ultimate N-terminal and C-terminal ends of 
the polypeptide, respectively. 

As used herein, the terms "complementary" or "complementarity" are used in reference 
to polynucleotides (i.e., a sequence of nucleotides such as an oligonucleotide or a target 
nucleic acid) related by the base-pairing rules. For example, for the sequence "A-G-T," is 
complementary to the sequence "T-C-A." Complementarity may be "partial," in which only 
some of the nucleic acids' bases are matched according to the base pairing rules. Or, there 
may be "complete" or "total" complementarity between the nucleic acids. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and 
strength of hybridization between nucleic acid strands. This is of particular importance in 
amplification reactions, as well as detection methods which depend upon binding between 
nucleic acids. 

The term "homology" refers to a degree of identity. There may be partial homology 
or complete homology. A partially identical sequence is one that is less than 100% identical 
to another sequence. 

As used herein, the term "hybridization" is used in reference to the pairing of 
complementary nucleic acids. Hybridization and the strength of hybridization (i.e., the 
strength of the association between the nucleic acids) is impacted by such factors as the 
degree of complementary between the nucleic acids, stringency of the conditions involved, the 
T m of the formed hybrid, and the G:C ratio within the nucleic acids. 

As used herein, the term "T m " is used in reference to the "melting temperature." The 
melting temperature is the temperature at which a population of double-stranded nucleic acid 
molecules becomes half dissociated into single strands. The equation for calculating the T m of 
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nucleic acids is well known in the art. As indicated by standard references, a simple estimate 
of the T m value may be calculated by the equation: T m = 81.5 + 0.41(% G + C), when a 
nucleic acid is in aqueous solution at 1 M NaCl (See e.g., Anderson and Young, Quantitative 
Filter Hybridization, in Nucleic Acid Hybridization (1985). Other references include more 
sophisticated computations which take structural as well as sequence characteristics into 
account for the calculation of T m . 

As used herein the term "stringency" is used in reference to the conditions of 
temperature, ionic strength, and the presence of other compounds, under which nucleic acid 
hybridizations are conducted. With "high stringency" conditions, nucleic acid base pairing 
will occur only between nucleic acid fragments that have a high frequency of complementary 
base sequences. Thus, conditions of "weak" or "low" stringency are often required when it is 
desired that nucleic acids which are not completely complementary to one another be 
hybridized or annealed together. 

The term "gene" refers to a DNA sequence that comprises control and coding 
sequences necessary for the production of a polypeptide or precursor. The polypeptide can be 
encoded by a full length coding sequence or by any portion of the coding sequence so long as 
the desired enzymatic activity is retained. 

The term "wild-type" refers to a gene or gene product which has the characteristics of 
that gene or gene product when isolated from a naturally occurring source. A wild-type gene 
is that which is most frequently observed in a population and is thus arbitrarily designed the 
"normal" or "wild-type" form of the gene. In contrast, the term "modified" or "mutant" refers 
to a gene or gene product which displays modifications in sequence and or functional 
properties (i.e., altered characteristics) when compared to the wild-type gene or gene product. 
It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that 
they have altered characteristics when compared to the wild-type gene or gene product. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired coding sequence and appropriate DNA sequences necessary for the 
expression of the operably linked coding sequence in a particular host organism. DNA 
sequences necessary for expression in prokaryotes include a promoter, optionally an operator 
sequence, a ribosome binding site and possibly other sequences. Eukaryotic cells are known 
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to utilize promoters, polyadenlyation signals and enhancers. 

The term "oligonucleotide" as used herein is defined as a molecule comprising two or 
more deoxyribonucleotides or ribonucleotides, preferably at least 5 nucleotides, more 
preferably at least about 10-15 nucleotides and more preferably at least about 15 to 30 
nucleotides. The exact size will depend on many factors, which in turn depends on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in any 
manner, including chemical synthesis, DNA replication, reverse transcription, or a 
combination thereof. 

Because mononucleotides are reacted to make oligonucleotides in a manner such that 
the 5 J phosphate of one mononucleotide pentose ring is attached to the 3 5 oxygen of its 
neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5' end" if its 5' phosphate is not linked to the 3' oxygen of a 
mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not linked to a 5' 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5' and 3 5 ends. 
A first region along a nucleic acid strand is said to be upstream of another region if the 3' 
end of the first region is before the 5' end of the second region when moving along a strand 
of nucleic acid in a 5 5 to 3' direction. 

When two different, non-overlapping oligonucleotides anneal to different regions of 
the same linear complementary nucleic acid sequence, and the 3' end of one oligonucleotide 
points towards the 5' end of the other, the former may be called the "upstream" 
oligonucleotide and the latter the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide which is capable of acting as a point of 
initiation of synthesis when placed under conditions in which primer extension is initiated. 
An oligonucleotide "primer" may occur naturally, as in a purified restriction digest or may be 
produced synthetically. 

A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize with a 
template strand for primer elongation to occur. A primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary nucleotide fragment may be 
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attached to the 5' end of the primer, with the remainder of the primer sequence being 
substantially complementary to the strand. Non-complementary bases or longer sequences can 
be interspersed into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence of the template to hybridize and thereby form a template 
primer complex for synthesis of the extension product of the primer. 

"Hybridization" methods involve the annealing of a complementary sequence to the 
target nucleic acid (the sequence to be detected; the detection of this sequence may be by 
either direct or indirect means). The ability of two polymers of nucleic acid containing 
complementary sequences to find each other and anneal through base pairing interaction is a 
well-recognized phenomenon. The initial observations of the "hybridization" process by 
Marmur and Lane, Proa Natl Acad. Sci. USA 46:453 (1960) and Doty et al. 9 Proc. Natl 
Acad. Sci USA 46:461 (1960) have been followed by the refinement of this process into an 
essential tool of modern biology. 

With regard to complementarity, it is important for some diagnostic applications to 
determine whether the hybridization represents complete or partial complementarity. For 
example, where it is desired to detect simply the presence or absence of pathogen DNA (such 
as from a virus, bacterium, fungi, mycoplasma, protozoan) it is only important that the 
hybridization method ensures hybridization when the relevant sequence is present; conditions 
can be selected where both partially complementary probes and completely complementary 
probes will hybridize. Other diagnostic applications, however, may require that the 
hybridization method distinguish between partial and complete complementarity. It may be of 
interest to detect genetic polymorphisms. For example, human hemoglobin is composed, in 
part, of four polypeptide chains. Two of these chains are identical chains of 141 amino acids 
(alpha chains) and two of these chains are identical chains of 146 amino acids (beta chains). 
The gene encoding the beta chain is known to exhibit polymorphism. The normal allele 
encodes a beta chain having glutamic acid at the sixth position. The mutant allele encodes a 
beta chain having valine at the sixth position. This difference in amino acids has a profound 
(most profound when the individual is homozygous for the mutant allele) physiological impact 
known clinically as sickle cell anemia. It is well known that the genetic basis of the amino 
acid change involves a single base difference between the normal allele DNA sequence and 
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the mutant allele DNA sequence. 

The complement of a nucleic acid sequence as used herein refers to an oligonucleotide 
which, when aligned with the nucleic acid sequence such that the 5' end of one sequence is 
paired with the 3' end of the other, is in "antiparallel association." Certain bases not 
commonly found in natural nucleic acids may be included in the nucleic acids of the present 
invention and include, for example, inosine and 7-deazaguanine. Complementarity need not 
be perfect; stable duplexes may contain mismatched base pairs or unmatched bases. Those 
skilled in the art of nucleic acid technology can determine duplex stability empirically 
considering a number of variables including, for example, the length of the oligonucleotide, 
base composition and sequence of the oligonucleotide, ionic strength and incidence of 
mismatched base pairs. 

The term "label" as used herein refers to any atom or molecule which can be used to 
provide a detectable (preferably quantifiable) signal, and which can be attached to a nucleic 
acid or protein. Labels may provide signals detectable by fluorescence, radioactivity, 
colorimetry, gravimetry, X-ray diffraction or absorption, magnetism, enzymatic activity, and 
the like. A label may be a charged moiety (positive or negative charge) or alternatively, may 
be charge neutral. 

The term "cleavage structure" as used herein, refers to a structure which is formed by 
the interaction of at least one probe oligonucleotide and a target nucleic acid to form a 
complex having at least one region of base pairing before the probe and target, the resulting 
structure being cleavable by a cleavage agent, including but not limited to an enzyme. The 
cleavage structure is a substrate for specific cleavage by the cleavage agent, in contrast to a 
nucleic acid molecule which is a substrate for non-specific cleavage by agents such as 
phosphodiesterases which cleave nucleic acid molecules without regard to secondary structure 
(i.e., no formation of a duplexed structure is required). 

The terms "cleavage means" and "cleavage agent" as used herein refer to any agent 
which is capable of cleaving a cleavage structure, including but not limited to enzymes (e.g., 
polymerases and flap endonucleases). The cleavage means may include native polymerases 
having 5' nuclease activity (e.g., Taq DNA polymerase, E. coli DNA polymerase I) and, more 
specifically, modified polymerases having 5' nuclease but lacking synthetic activity. 
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"Structure-specific nucleases" or "structure-specific enzymes" are enzymes which recognize 
specific secondary structures in a nucleic molecule and cleave these structures. The cleavage 
means of the invention cleave a nucleic acid molecule in response to the formation of 
cleavage structures; it is not necessary that the cleavage means cleave the cleavage structure at 
any particular location within the cleavage structure. 

The cleavage means is not restricted to enzymes having solely 5 ? nuclease activity. 
The cleavage means may include nuclease activity provided from a variety of sources 
including, but not limited to, the CLEAVASE enzymes, the FEN-1 endonucleases (including 
RAD2 and XPG proteins), Taq DNA polymerase and E. coli DNA polymerase L 

The term "thermostable" when used in reference to an enzyme, such as a 5' nuclease, 
indicates that the enzyme is functional or active {i.e., can perform catalysis) at an elevated 
temperature, e.g., at about 55°C or higher. 

The term "cleavage products" as used herein, refers to products generated by the 
reaction of a cleavage means with a cleavage structure {i.e., the treatment of a cleavage 
structure with a cleavage means). 

The term "non-target cleavage product" refers to a product of a cleavage reaction 
which is not derived from the target nucleic acid. As discussed above, in the methods of the 
present invention, cleavage of the cleavage structure occurs within the probe oligonucleotide. 
The fragments of the probe oligonucleotide generated by this target nucleic acid-dependent 
cleavage are "non-target cleavage products." 

The term "substantially single- stranded" when used in reference to a nucleic acid 
substrate means that the substrate molecule exists primarily as a single strand of nucleic acid 
in contrast to a double-stranded substrate which exists as two strands of nucleic acid which 
are held together by inter-strand base pairing interactions. 

The term "sequence variation" as used herein refers to differences in nucleic acid 
sequence between two nucleic acids. For example, a wild-type structural gene and a mutant 
form of this wild-type structural gene may vary in sequence by the presence of single base 
substitutions and/or deletions or insertions of one or more nucleotides. These two forms of 
the structural gene are said to vary in sequence from one another. A second mutant form of 
the structural gene may exist. This second mutant form is said to vary in sequence from both 
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the wild-type gene and the first mutant form of the gene. 

The term "liberating" as used herein refers to the release of a nucleic acid fragment 

from a larger nucleic acid fragment, such as an oligonucleotide, by the action of a 5' nuclease 

such that the released fragment is no longer covalently attached to the remainder of the 
5 oligonucleotide. 

The term "K^" as used herein refers to the Michaelis-Menten constant for an enzyme 

and is defined as the concentration of the specific substrate at which a given enzyme yields 

one-half its maximum velocity in an enzyme catalyzed reaction. 

The term "nucleotide analog" as used herein refers to modified or non-naturally 
10 occurring nucleotides such as 7-deaza purines {i.e., 7-deaza-dATP and 7-deaza-dGTP). 

Nucleotide analogs include base analogs and comprise modified forms of 

deoxyribonucleotides as well as ribonucleotides, 
yg The term "polymorphic locus" is a locus present in a population which shows variation 

r\ between members of the population {i.e., the most common allele has a frequency of less than 
'115 0.95). In contrast, a "monomorphic locus" is a genetic locus at little or no variations seen 
□ between members of the population (generally taken to be a locus at which the most common 
* allele exceeds a frequency of 0.95 in the gene pool of the population). 
O The term "microorganism" as used herein means an organism too small to be observed 

m with the unaided eye and includes, but is not limited to bacteria, virus, protozoans, fungi, and 
JH) ciliates. 

O The term "microbial gene sequences" refers to gene sequences derived from a 

microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacterial species. 

25 The term "virus" refers to obligate, ultramicroscopic, intracellular parasites incapable 

of autonomous replication {i.e., replication requires the use of the host cell's machinery). 

The term "sample" in the present specification and claims is used in its broadest sense. 
On the one hand it is meant to include a specimen or culture {e.g., microbiological cultures or 
cultured eukaryotic tissue cells). On the other hand, it is meant to include both biological and 

30 environmental samples. 
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Biological samples may be animal, including human, fluid, solid (e.g., stool) or tissue, 
as well as liquid and solid food and feed products and ingredients such as dairy items, 
vegetables, meat and meat by-products, and waste. Biological samples may be obtained from 
all of the various families of domestic animals, as well as feral or wild animals, including, but 
5 not limited to, such animals as ungulates, bear, fish, lagamorphs, rodents, etc. 

Environmental samples include environmental material such as surface matter, soil, 
water and industrial samples, as well as samples obtained from food and dairy processing 
instruments, apparatus, equipment, utensils, disposable and non-disposable items. These 
examples are not to be construed as limiting the sample types applicable to the present 
10 invention. 

The term "source of target nucleic acid" refers to any sample which contains nucleic 
acids (RNA or DNA). Particularly preferred sources of target nucleic acids are biological 
yS samples including, but not limited to cultured cells, blood, saliva, cerebral spinal fluid, pleural 
'f\ fluid, milk, lymph, sputum, and semen. 

H5 An oligonucleotide is said to be present in "excess" relative to another oligonucleotide 

□ (or target nucleic acid sequence) if that oligonucleotide is present at a higher molar 

^ concentration relative to that of the other oligonucleotide (or target nucleic acid sequence). 
O Typically, when present in excess, the probe oligonucleotide will be present at least a 100-fold 
riji molar excess. 

2|0 A sample "suspected of containing" a first and a second target nucleic acid may 

O contain either, both or neither target nucleic acid molecule. 

The terms "polymerization means" or "polymerization agent" refer to any agent 
capable of facilitating the addition of nucleoside triphosphates to an oligonucleotide. 

The terms "ligation means" or "ligation agent" refer to any agent capable of facilitating 
25 the ligation (i.e., the formation of a phosphodiester bond between a 3 5 -OH and a 5'-P located 
at the termini of two strands of nucleic acid). Preferred ligation means comprise DNA ligases 
and RNA ligases. 

The term "reactant" is used herein in its broadest sense. In some embodiments, the 
reactant can comprise an enzymatic reactant, a chemical reactant or ultraviolet light 
30 (ultraviolet light, particularly short wavelength ultraviolet light is known to break 
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oligonucleotide chains). Any agent capable of reacting with an oligonucleotide to either 
shorten {i.e., cleave) or elongate the oligonucleotide is encompassed within the term 
"reactant." 

The terms "adduct" and "moiety" are used interchangeably, are used herein in their 
broadest sense to indicate any compound or element which can be added to an 
oligonucleotide. An adduct may be charged (positively or negatively) or may be charge 
neutral. An adduct may be added to the oligonucleotide via covalent or non-covalent 
linkages. Examples of adducts, include but are not limited to indodicarbocyanine dye 
amidites, amino-substituted nucleotides, ethidium bromide, ethidium homodimer, 
(l,3-propanediamino)propidium, (diethylenetriamino)propidium, thiazole orange, 
(N-N'-tetramethyl-l,3-propanediamino)propyl thiazole orange, 

(K-N ? -tetramethyl-l,2-ethanediamino)propyl thiazole orange, thiazole orange-thiazole orange 
homodimer (TOTO), thiazole orange-thiazole blue heterodimer (TOTAB), thiazole 
orange-ethidium heterodimer 1 (TOED1), thiazole orange-ethidium heterodimer 2 (TOED2) 
and fluorescein-ethidium heterodimer (FED), psoralens, biotin, streptavidin, avidin, etc. 

The term "recombinant DNA molecule" as used herein refers to a DNA molecule that 
comprises of segments of DNA joined together by means of molecular biological techniques. 

The term "recombinant protein" or "recombinant polypeptide" as used herein refers to 
a protein molecule that is expressed from a recombinant DNA molecule. 

As used herein the term "portion" when in reference to a protein (as in "a portion of a 
given protein") refers to fragments of that protein. The fragments may range in size from 
four amino acid residues to the entire amino acid sequence minus one amino acid. 

"Nucleic acid sequence" as used herein refers to an oligonucleotide, nucleotide or 
polynucleotide, and fragments or portions thereof, and to DNA or RNA of genomic or 
synthetic origin which may be single- or double-stranded, and represent the sense or antisense 
strand. Similarly, "amino acid sequence" as used herein refers to peptide or protein sequence. 

"Peptide nucleic acid" ("PNA") as used herein refers to a molecule which comprises an 
oligomer to which an amino acid residue, such as lysine, and an amino group have been 
added. These small molecules, also designated anti-gene agents, stop transcript elongation by 
binding to their complementary strand of nucleic acid (Nielsen et al 9 Anticancer Drug Des. 
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8:53 [1993]). 

As used herein, the terms "purified" or "substantially purified" refer to molecules, 
either nucleic or amino acid sequences, that are removed from an environment, isolated or 
separated, and are at least 60% free, preferably 75% free, and most preferably 90% free from 
5 other components with which they were associated in the starting environment. The starting 
environment may be a natural one, as in the isolation of a non-recombinant protein, or it may 
be a created environment, as is the isolation of a recombinant protein from a host cell. For 
example, recombinant CLEAVASE nucleases may be expressed in bacterial host cells and the 
nucleases may be purified by the removal of host cell proteins; the percent of these 
10 recombinant nucleases is thereby increased in the sample. An "isolated polynucleotide" or 
"isolated oligonucleotide" is therefore a substantially purified polynucleotide. 

As used herein, the term "fusion protein" refers to a chimeric protein containing the 
yfl protein of interest joined to an exogenous protein fragment. The fusion partner may enhance 
solubility of recombinant chimeric protein as expressed in a host cell, may provide an affinity 
T ll5 tag (e.g., a his-tag) to allow purification of the recombinant fusion protein from the host cell 
r 3 or culture supernatant, or both. If desired, the fusion protein may be removed from the 
^ protein of interest by a variety of enzymatic or chemical means known to the art. 
O As used herein, the terms "chimeric protein" and "chimerical protein" refer to a single 

fy protein molecule that comprises amino acid sequence portions derived from two or more 
jlp parent proteins. These parent molecules may be similar proteins from genetically distinct 
£3 origins, different proteins from a single organism, or dissimilar proteins from different 

organisms. By way of example but not by way of limitation, a chimeric structure-specific 
nuclease of the present invention may contain a mixture of amino acid sequences that have 
been derived from DNA polymerase genes from two or more of the organisms having such 
25 genes, combined to form a non-naturally occurring 5' nuclease. The term "chimerical" as 
used herein is not intended to convey any particular proportion of contribution from the 
naturally occurring genes, nor limit the manner in which the portions are combined. Any 
chimeric structure-specific nuclease constructs having cleavage activity as determined by the 
testing methods described herein, for example, are improved cleavage agents within the scope 
30 of the present invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

The INVADER invasive cleavage reaction has been shown to be useful in the 
detection of RNA target strands (See e.g., U.S. Patent 6,001,567, incorporated herein by 
reference in its entirety). As with the INVADER assay for the detection of DNA (Lyamichev 
5 et aL 9 Nat. Biotechnol, 17:292 [1999]), the reactions may be run under conditions that permit 
the cleavage of many copies of a probe for each copy of the target RNA present in the 
reaction. In one embodiment, the reaction may be performed at a temperature close to the 
melting temperature (TJ of the probe that is cleaved, such that the cleaved and uncleaved 
probes readily cycle on and off the target strand without temperature cycling. Each time a 
10 full-length probe binds to the target in the presence of the INVADER oligonucleotide, it may 
be cleaved by a 5 5 nuclease enzyme, resulting in an accumulation of cleavage product. The 
accumulation is highly specific for the sequence being detected, and may be configured to be 
yjj proportional to both time and target concentration of the reaction. In another embodiment, 

the temperature of the reaction may be shifted (i.e., it may be raised to a temperature that will 
cause the probe to dissociate) then lowered to a temperature at which a new copy of the probe 
□ hybridizes to the target and is cleaved by the enzyme. In a further embodiment, the process 
2 r of raising and lowering the temperature is repeated many times, or cycled, as it is in PCR 
D (Mullis and Faloona, Methods in Enzymology, 155:335 [1987], Saiki et al 9 Science 230:1350 
ffi [1985]). 

W As noted above, 5' nucleases of Pol A type DNA polymerases are preferred for 

Q cleavage of an invasive cleavage structure that comprises an RNA target strand. The present 
invention provides enzymes having improved performance in detection assays based on the 
cleavage of a structure comprising RNA. In particular, the altered polymerases of the present 
invention exhibit improved performance in detection assays based on the cleavage of a DNA 
25 member of an invasive cleavage structure that comprises an RNA target strand. 

The 5 5 nucleases of the present invention may be derived from Pol A type DNA 
polymerases. The terminology used in describing the alterations made in this class of 5' 
nucleases relates to the descriptions of DNA polymerase structures known in the art. The 
Klenow fragment of the Pol A polymerase from E. coli (the C-terminal two thirds, which has 
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the DNA synthesizing activity but lacks the 5' nuclease activity) has been described as having 
a physical form resembling a right hand, having an open region called the "palm", and a cleft 
that holds the primer/template duplex defined on one side by a "fingers" domain and on the 
other by a "thumb" domain (Joyce and Steitz, Trends in Biochemical Science 12:288 [1987]). 
5 This is shown schematically in Figure 5. Because this physical form has proved to be 

common to all Pol A DNA polymerases and to a number of additional template-dependent 
polymerizing enzymes such as reverse transcriptases, the hand terminology has become known 
in the art, and the sites of activity in these enzymes are often described by reference to their 
position on the hand. For reference, and not intended as a limitation on the present invention, 
10 the palm is created from roughly the first 200 amino acids of the polymerase domain, the 

thumb from the middle 140, and the fingers by the next 160, with the base of the cleft formed 
from the remaining regions (Figures 6). Although some enzymes may deviate from these 

y| structural descriptions, the equivalent domains and sequences corresponding to such domains 
may be identified by sequence homology to known enzyme sequences, by comparison of 

^5 enzyme crystal structures, and other like methods, 

g In creating the improved enzymes of the present invention, several approaches have 

^ been taken, although the present invention in not limited to any particular approach. First two 
O DNA polymerases, Taq and Tth, that have different rates of DNA strand cleavage activity on 

RNA targets were compared. To identify domains related to the differences in activity, a 
if 0 series of chimerical constructs was created and the activities were measured. This process 
□ identified two regions of the Tth polymerase that could, if transferred into the Taq 

polymerase, confer on the TaqPol an RNA-dependent cleavage activity equivalent to that of 
the native Tth protein. Once these regions were identified, the particular amino acids 
involved in the activity were examined. Since the two proteins are about 87 percent identical 
25 in amino acid sequence overall, the identified regions had only a small number of amino acid 
differences. By altering these amino acids singly and in combinations, a pair of amino acids 
were identified in TthPol that, if introduced into the TaqPol protein, increased the rate of 
cleavage up to that of the native TthPol. 

These data demonstrate two important aspects of the present invention. First, specific 
30 amino acids can be changed to confer TthPol-like RNA-dependent cleavage activity on a 
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polymerase having a lesser activity. More broadly, however, these results provide regions of 
these polymerases that are involved in the recognition of the RNA-containing cleavage 
structure. Identification of these important regions, combined with published information on 
the relationships of other amino acids to the various functions of these DNA polymerases and 
computer-assisted molecular modeling during the development of the present invention have 
allowed a rational design approach to create additional improved 5' nucleases. The 
information also allowed a focused random mutagenesis approach coupled with a rapid 
screening procedure to quickly create and identify enzymes having improved properties. 
Using these methods of the present invention, a wide array of improved polymerases are 
provided. 

The methods used in creating and selecting the improved 5' nucleases of the present 
invention are described in detail below and in the experimental examples. A general 
procedure for screening and characterizing the cleavage activity of any 5' nuclease is included 
in the experimental examples. The methods discussions are divided into the following 
sections: I) Creation and selection of chimerical constructs; II) Site-specific mutagenesis 
based on information from chimerical constructs; III) Site-specific mutagenesis based on 
molecular modeling and published physical studies; and IV) focused random mutagenesis. 

I) Creation and selection of chimerical constructs 

The PolA-type DNA polymerases, including but not limited to DNA polymerase 
enzymes from Thermus species, comprise two distinctive domains, the 5' nuclease and the 
polymerase domains, shown schematically in Figure 6. The polymerase domains reside in the 
C-terminal two-thirds of the proteins and are responsible for both DNA-dependent and 
RNA-dependent DNA polymerase activities. The N-terminal one-third portions contain the 5' 
nuclease domains. In the genus Thermus Pol A polymerase, the palm region consists of, 
roughly, amino acids 300-500, the thumb region includes amino acids 500-650, while the 
fingers region is formed by the remaining amino acids from 650 to 830 (Figure 6). 

The derivatives, Taq DN RX HT and Tth DN RX HT, of Taq and TthPol used in 
many of the experiments of the present invention, and described herein, are modified to 
reduce synthetic activity and to facilitate chimera construction, but have 5' nuclease activity 
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essentially identical to unmodified TaqPol and TthPol. Unless otherwise specified, the TaqPol 
and TthPol enzymes of the following discussion refer to the DN RX HT derivative. 

TthPol has a 4- fold higher cleavage rate with the IL-6 RNA template (shown in Figure 
10) than TaqPol (shown in Figures 11 and 12), although the Taq and TthPols show 
5 similarities of cleavage in DNA target structures (Figure 10). Since the amino acid sequences 
of TaqPol and TthPol (Figures 8 and 9) share about 87% identity and greater than 92% 
similarity, the high degree of homology between the enzymes allowed creation of a series of 
chimeric enzymes between TthPol and TaqPol. The activity of the chimeric enzymes was 
used as a parameter to identify the region(s) of these proteins affecting RNA dependent 5' 
10 nuclease activity. 

The chimeric constructs between TthPol and TaqPol genes shown schematically in 
Figures 7 and 19 were created by swapping DNA fragments defined by the restriction 
y| endonuclease sites, EcoRI and BamHI, common for both genes, the cloning vector site Sail 
and the new sites, NotI, BstBI and Ndel, created at the homologous positions of both genes 
Hi5 by site directed mutagenesis. The restriction enzymes have been abbreviated as follows: 
q EcoRI is E; NotI is N; BstBI is Bs; Ndel is D, BamHI is B, and Sail is S. 
^ The activity of each chimeric enzyme was evaluated using the invasive signal 

CI amplification assay with the IL-6 RNA target (Figure 10), and the cycling cleavage rates 
fti shown in Figure 12 were determined as described in the Experimental Examples. Comparison 
jf 0 of the cleavage rates of the first two chimeras, TaqTth(N) and TthTaq(N), created by 
p swapping the polymerase and 5' nuclease domains at the NotI site (Figure 7), shows that 
TaqTth(N) has the same activity as TthPol, whereas its counterpart TthTaq(N) retains the 
activity of TaqPol (Figure 12). This result indicates that the higher cleavage rate of TthPol is 
associated with its polymerase domain and suggests an important role of the polymerase 
25 domain in the 5' nuclease activity. 

The next step was to identify a minimal region of TthPol polymerase that would give 
rise to the TthPol-like RNA dependent 5' nuclease activity when substituted for the 
corresponding region of the TaqPol sequence. To this end, the TaqTth(N) chimera was 
selected to generate a series of new constructs by replacing its TthPol sequence with 
30 homologous regions of TaqPol. First, the N-terminal and C-terminal parts of the TaqPol 
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polymerase domain were substituted for the corresponding regions of TaqTth(N) using the 
common BamHI site as a breaking point to create TaqTth(N-B) and TaqTth(B-S) chimeras, 
respectively (Figure 7). TaqTth(N-B) which has the TthPol sequence between amino acids 
328 and 593, is approximately 3 times more active than the TaqTth(B-S) and 40% more 
active than TthPol (Figure 12). This result establishes that the Notl-BamHI portion of the 
TthPol polymerase domain determines superior RNA-dependent 5' nuclease activity of 
TthPol. 

From these data it was determined that a central portion of the TthPol, when used to 
replace the homologous portion of TaqPol (TaqTth(N-B) construct) could confer superior 
RNA recognition on the chimerical protein composed primarily of Taq protein. In fact, the 
cycling rate of this chimerical protein exceeded that of the parent TthPol Comparison of 
chimeras that included sub-portions of the activity-improving region of TthPol, approximately 
50% of the region in each case (See, TaqTth(N-D) and TaqTth(D-B), Figures 7 and 12) 
showed no significant improvement in RNA dependent activity as compared to the parent 
TaqPol. This result indicates that aspects of each half of the region are required for this 
activity. A construct having an only slightly smaller portion of the Tth insert portion 
(TaqTth(Bs-B)) showed activity that was close to that of the parent TthPol protein, but which 
was less than that of the TaqTth(N-B) construct. 

H) Site-specific mutagenesis based on information from chimerical constructs 

Comparison of the TthPol and TaqPol amino acid sequences between the BstBI and 
BamHI sites reveals only 25 differences (Figure 13). Among those, there are 12 conservative 
changes and 13 substitutions resulting in a change in charge. Since the analysis of the 
chimeric enzymes has suggested that some critical amino acid changes are located in both 
BstBI-Ndel and Ndel-BamHI regions of TthPol, site directed mutagenesis was used to 
introduce the TthPol specific amino acids into the BstBI-Ndel and Ndel-BamHI regions of the 
TaqTth(D-B) and TaqTth (N-D) chimeras, respectively. Six TthPol-specific substitutions 
were generated in the BstBI-Ndel region of the TaqTth(D-B) by single or double amino acid 
mutagenesis and only one double mutation, W417L/G418K, was able to restore the TthPol 
activity with the IL-6 RNA target (See e.g., Figure 14). Similarly, 12 TthPol specific amino 
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acids were introduced at the homologous positions of the Ndel-BamHI region of the 
TaqTth(N-D) and only one of them, E507Q, increased the cleavage rate to the TthPol level 
(See e.g., Figure 14), 

To confirm that the W417L, G418K and E507Q substitutions are sufficient to increase 
the TaqPol activity to the TthPol level, TaqPol variants carrying these mutations were created 
and their cleavage rates with the IL-6 RNA substrate were compared with that of TthPol 
Figure 15 shows that the TaqPol W417L/G418K/E507Q and TaqPol G418K/E507Q mutants 
have 1.4 times higher activity than TthPol and more than 4 fold higher activity than TaqPol, 
whereas the TaqPol W417L/E507Q mutant has the same activity as TthPol, which is about 3 
fold higher than TaqPol These results demonstrate that K418 and Q507 of TthPol are 
important amino acids in defining its superior RNA dependent 5' nuclease activity compared 
to TaqPol 

The ability of these amino acids to improve the RNA dependent 5 ? nuclease activity of 
a DNA polymerase was tested by introducing the corresponding mutations into the polymerase 
A genes of two additional organisms: Thermus filiformus and Thermus scotoductus, TaqPol 
showed improved RNA dependent activity when it was modified to contain the W417L and 
E507Q mutations, which made it more similar at these residues to the corresponding residues 
of TthPol (K418 and Q507). The TfiPol was modified to have P420K and E507Q, creating 
TfiDN 2M, while the TscPol was modified to have E416K and E505Q, to create TscDN 2M. 
The activity of these enzymes for cleaving various DNA and RNA containing structures was 
determined as described in Example 1, using the IdT2, IrT3, hairpin and X-structures 
diagrammed in Figures 21 and 22, with the results shown in both Figure 25 and Table 7. 
Both enzymes have much less RNA-dependent cleavage activity than either the TthPol or the 
Taq 2M enzymes. However, introduction of the mutations cited above into these polymerases 
increased the RNA dependent cleavage activity over 2 fold compared to the unmodified 
enzymes (Figure 25). These results demonstrate that transferability of improved RNA 
dependent cleavage activity into a wide range of polymerases using the methods of the present 
invention. 
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Ill) Site-specific mutagenesis based on molecular modeling and published physical 
studies 

The positions of the G418H and E507Q mutations in the crystal structure of a complex 
of the large fragment of TaqPol (Klentaql) with a primer/template DNA determined by Li et 
5 al (Li et al, Protein Sci., 7:1116 [1998]) are shown in Figure 17. The E507Q mutation is 
located at the tip of the thumb subdomain at a nearest distance of 3.8 A and 18 A from the 
backbone phosphates of the primer and template strands, respectively. The interaction 
between the thumb and the minor groove of the DNA primer/template was previously 
suggested by the co-crystal structures of Klenow fragment DNA polymerase I (Breese et al, 

10 Science 260:352 [1993]) and TaqPol (Eom et al, Nature 382:278 [1996]). Deletion of a 24 
amino acid portion of the tip of the thumb in Klenow fragment, corresponding to amino acids 
494-518 of TaqPol, reduces the DNA binding affinity by more than 100-fold (Minnick et al, 

yj J. Biol. Chem., 271:24954 [1996]). These observations are consistent with the hypothesis that 

jr :i j the thumb region, which includes the E507 residue, is involved in interactions with the 

V W upstream substrate duplex. 

g The W417L and the G418K mutations in the palm region of TaqPol (Figure 17) are 

J" located approximately 25 A from the nearest phosphates of the template and upstream strands, 
O according to the co-crystal structures of TaqPol with duplex DNA bound in the polymerizing 
fy mode (Li et al 9 Protein Sci., 7:1116 [1998], Eom et al, Nature 382:278 [1996]). The same 
jgp distance was observed between the analogous W513 and P514 amino acids of Klenow 
O fragment and the template strand of DNA bound in the editing mode (Breese et al, Science 
260:352 [1993]). Thus, no interactions between TaqPol and the overlapping substrate can be 
suggested from the available co-crystal studies for this region. 

Although an understanding of the mechanism of action of the enzymes is not necessary 
25 for the practice of the present invention and the present invention is not limited to any 

mechanism of action, it is proposed that the amino acids at positions 417 and 418 in the palm 
region of TaqPol interact with the upstream substrate duplex only when the enzyme functions 
as a 5' nuclease, but no interaction with these amino acids occurs when TaqPol switches into 
polymerizing mode. This hypothesis suggests a novel mode of substrate binding by DNA 
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polymerases called here the "5' nuclease mode." Several lines of evidence support this 
hypothesis. The study of the chimeric enzymes described here clearly separates regions of the 
polymerase domain involved in the 5' nuclease and polymerase activities. Accordingly, the 
W417L and G418K mutations, together with the E507Q mutation, affect the 5' nuclease 
activity of TaqPol on substrates having an RNA target strand (Figure 15), but have no effect 
on either RNA-dependent or DNA-dependent DNA polymerase activities (Figure 16). On the 
other hand, mutations in the active site of TaqPol, such as R573A, R587A, E615A, R746A, 
N750A and D785N, which correspond to substitutions in Klenow fragment of E.coli DNA Pol 
I that affect both polymerase activity and substrate binding affinity in the polymerizing mode 
(Polesky et al 9 J. Biol Chem., 265:14579 [1990], Polesky et al 9 J. Biol. Chem., 267:8417 
[1992], Pandey et al 9 Eur. J. Biochem., 214:59 [1993]) were shown to have little or no effect 
on the 5' nuclease activity. Superposition of the polymerase domains of TaqPol (Eom et al 9 
Nature 382:278 [1996]), E.coli Pol I and Bacillus stearothermophilus Pol I (Kiefer et al 9 
Nature 391:304 [1998]) using the programs DALI (Holm and Sander, J. Mol. Biol., 233:123 
[1993], Holm and Sander, Science 273:595 [1996]) and Insight II (Molecular Simulation Inc., 
Naperville, IL) shows that the palm region of TaqPol between amino acids 402-45 1, including 
W417 and G418, is structurally highly conserved between the three polymerases, although 
there is no structural similarity between the rest of the palm subdomains. This observation 
suggests an important role for this region in eubacterial DNA polymerases. 

The 5' nuclease and polymerase activities should be precisely synchronized to create a 
nicked structure rather than a gap or an overhang that could cause a deletion or an insertion 
during Okazaki fragment processing or DNA repair, if ligase joins the ends inappropriately. 
According to the previously proposed model (Kaiser et al 9 J. Biol. Chem., 274:21387 [1999]), 
the 3' terminal nucleotide of the upstream strand is sequestered by the 5' nuclease domain to 
prevent its extension, thus halting synthesis. The interaction with the 3 5 nucleotide apparently 
activates the 5 J nuclease that endonucleolitically removes the displaced 5' arm of the 
downstream strand. This cleavage occurs by the precise incision at the site defined by the 3' 
nucleotide, thus creating the nick. This model requires a substantial rearrangement of the 
substrate-enzyme complex, which may include a translocation of the complex to the 5 5 
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nuclease mode to separate the primer/template from the polymerase active site. 

It is possible that a relocation of the substrate away from the polymerase active site 
could be induced by the interaction between the duplex formed between the template and 
incoming strands and the crevice formed by the finger and thumb subdomains. Such an 
5 interaction could force conformational transitions in the thumb that would bring the 

template/primer duplex into close contact with the W417 and G418 amino acids. Significant 
flexibility of the thumb has been previously reported that might explain such changes (Beese 
et al, Science 260:352 [1993], Eom et al, Nature 382:278 [1996], Ollis et al, Nature 
313:762 [1985], Kim et al, Nature 376:612 [1995], Korolev et al, Proc. Natl. Acad. Sci., 
10 92:9264 [1995], Li et al, EMBO J., 17:7514 [1998]). Additional conformational changes of 
the fingers domain that might help to open the crevice, such as the transition from the 
'closed' to the 'open' structure described by Li et al (Li et al, EMBO J., 17:7514 [1998]), 
yg are consistent with this model. It may be that the 5' nuclease binding mode was not observed 
't\ in any of the published co-crystal structures of a DNA Pol I because the majority of the 
ll5 structures were solved for the polymerase domain only, with a template/primer substrate rather 
p than with an overlapping 5' nuclease substrate. 

+ values of 200-300 nM have been determined for TaqPol, TthPol and TaqPol 

O G418K/E507Q for the RNA containing substrate. These values are much higher than the 
f || value of <1 nM estimated for TthPol with an all-DNA overlapping substrate suggesting that 
j|0 the RNA template adversely affects substrate binding. The low affinity could be explained by 
O the unfavorable interaction between the enzyme and either the A- form duplex adopted by the 
substrate with an RNA target, or the ribose T hydroxy Is of the RNA strand. Between these 
two factors, the latter seems more likely, since the 5' nucleases of eubacterial DNA 
polymerases can efficiently cleave substrates with an RNA downstream probe (Lyamichev et 
25 al, Science 260:778 [1993]), which would presumably have an A-form. Further, the 

co-crystal studies suggest that the template/primer duplex partially adopts a conformation 
close to A-form in its complex with DNA polymerase (Eom et al, Nature 382:278 [1996], 
Kiefer et al, Nature 391:304 [1998], Li et al, EMBO J., 17:7514 [1998]). The 
G418K/E507Q mutations increase the k cat of TaqPol more than two fold, but have little effect 
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on Kj,,. Such an effect would be expected if the mutations position the substrate in an 
orientation more appropriate for cleavage rather than simply increasing the binding constant. 

In addition to the mutational analysis described above, another approach to studying 
specific regions of enzymes, enzyme structure-function relationships, and enzyme-substrate 
interaction is to investigate the actual, physical structure of the molecule. 

With the advances in crystallographic, NMR, and computer and software technology, 
study of molecular structure has become a viable tool for those interested in the configuration, 
organization, and dynamics of biomolecules. Molecular modeling has increased the 
understanding of the nature of the interactions that underlie the structure of proteins and how 
proteins interact functionally with substrate. Numerous publications describing the structures 
of various polymerases or polymerase protein portions, HIV reverse transcriptase, and other 
nucleic acid binding proteins have provided mechanistic insights into protein conformation, 
changes in conformation, and molecular interactions necessary for function. 

As an example, the report by Doublie et al (Doublie et al 9 Nature 391:251 [1998]) 
discloses the crystal structure of T7 DNA polymerase and provides information about which 
amino acid regions are likely to have an affect on substrate binding, which are required to 
contact the substrate for polymerization, and which amino acids bind cofactors, such as metal 
ions. It is noted in this paper and others that many of the polymerases share not only 
sequence similarity, but structural homology as well. When certain structural domains of 
different polymerases are superimposed (for example, T7 polymerase, Klenow fragment 
editing complex, the unliganded Taq DNA polymerase and the Taq Polymerase-DNA 
complex) conserved motifs are clearly discernable. 

Specifically, combining the information from all of these different structural sources 
and references, a model of the protein interacting with DNA, RNA, or heteroduplex can be 
made. The model can then be examined to identify amino acids that may be involved in 
substrate recognition or substrate contact. Changes in amino acids can be made based on 
these observations, and the effects on the various activities of the 5' nuclease proteins are 
assessed using screening methods such as those of the present invention, described in the 
experimental examples. 

The domain swapping analysis discussed previously demonstrated that sequences of 
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TthDN that are important in RNA-dependent 5' nuclease activity lie in the polymerase domain 
of the protein. Therefore, study of structural data of the polymerase domain with respect to 
nucleic acid recognition provides one method of locating amino acids that, when altered, alter 
RNA recognition in a 5' nuclease reaction. For example, analysis conducted during the 
development of the present invention examined published analyses relating to primer/template 
binding by the polymerase domain of E. coli Pol 1, the Klenow fragment. Table 1 shows a 
sampling of kinetic constants determined for the Klenow fragment, and shows the effects a 
number of mutations on these measurements. The corresponding or similarly positioned 
amino acids in the TaqPol are indicated in the right hand column. It was postulated that 
mutations having a noticible impact on the interactions of the Klenow fragment with the DNA 
template or the primer/template duplex, as indicated by changes in K d and Relative DNA 
affinity values, might also have effects when made at the corresponding sites in TaqPol and 
related chimerical or mutant derivatives. A selection of the mutations that produced a higher 
value or a lower Relative DNA affinity value when introduced into the Klenow fragment 
were created and examined in TaqPol. These Taq derivatives include, but are not limited to, 
those indicated by asterisks in the right hand column of Table 1. 

For some Klenow variants, such as the R682 mutants, selection for testing was not 
made based on the DNA affinity measurements, but because molecular modeling suggested 
interaction between some aspect of the template/primer duplex and that amino acid. 
Similarly, additional regions of Taq polymerase (or Taq derivatives) were targeted for 
mutagenesis based on structural data and information from molecular modeling. Based on 
modeling, the thumb region was postulated to contact an RNA template. Thus, amino acids 
in the thumb region were looked for that, if altered, might alter that contact. For example, 
Figures 6 and 17 show that amino acids 502, 504, and 507 are located at the tip of the thumb. 
It was postulated that altering these amino acids might have an affect on the enzyme-substrate 
interaction. Using the activity screening methods described In Example 1, mutations that 
produced beneficial effects were identified. This approach was used to create a number of 
improved enzymes. For example, TaqPol position H784, corresponding to Klenow amino 
acid H881, is an amino acid in the fingers region and, as such, may be involved in 
primer/template substrate binding. When the H881 amino acid in the Klenow enzyme is 
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replaced by alanine, the change decreases the affinity of the enzyme for DNA to only 30 to 
40% of the wild type level. An analogous substitution was tested in a TaqPol-derived 
enzyme. Starting with the Taq derivative W417L/G418K/E507Q, amino acid 784 was 
changed from Histidine (H) to Alanine (A) to yield the W417L/G418K/E507Q/H784A 
mutant, termed Taq 4M. This variant showed improved 5' nuclease activity on the RNA test 
IrTl (Figure 24) test substrate (data in Table 2). Amino acid R587 is in the thumb region, 
and was selected for mutation based on its close proximity to the primer/template duplex in 
computer models. When an R587A mutation was added to the Taq 4M variant, the activity 
on the test IrTl test substrate was still further improved. In addition, the reduction, relative to 
the 4M variant, in cleavage of the X structure shown in Figure 22 constitutes an additional 
improvement in this enzyme's function. 

Not all amino acid changes that reduce DNA binding in the polymerization affect the 
5' nuclease activity. For example, mutations E615A, R677A, affecting amino acid that are 
also in the thumb and fingers domains, respectively, have either adverse effect, or no effect on 
the 5' nuclease activities, respectively, as measured using the test substrates in Figures 21 and 
22, and compared to the parent variants that lacked these changes. The R677A mutation was 
added to, and compared with the TaqSS variant, while the E615A mutation was added to and 
compared with the Taq 4M variant. The test methods described herein provide a convenient 
means of analyzing any variant for the alterations in the cleavage activity of both invasive an 
noninvasive substrates, for both DNA and RNA containing structures. Thus, the present 
invention provides methods for identifying all suitable improved enzymes. 

Alterations that might increase the affinity of the enzymes for the nucleic acid targets 
were also examined. Many of the mutations described above were selected because they 
caused the Klenow fragment enzyme to have decreased affinity for DNA, with the goal of 
creating enzymes more accepting of structures containing non-DNA strands. In general, the 
native DNA polymerases show a lower affinity for RNA/DNA duplexes, compared to their 
affinity for DNA/DNA duplexes. During the development of the present invention, it was 
sought to increase the general affinity of the proteins of the present invention for a nucleic 
acid substrate without restoring or increasing any preference for structures having DNA rather 
than RNA target strands. The substitution of amino acids having different charges was 
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examined as a means of altering the interaction between the proteins and the nucleic acid 
substrates. For example, it was postulated that addition of positively charged amino acid 
residues, such as lysine (K), might increase the affinity of a protein for a negatively charged 
nucleic acid. 

As noted above, alterations in the thumb region could affect the interactions of the 
protein with the nucleic acid substrate. In one example, the mutation G504K (tip of the 
thumb) was introduced in Taq4M and caused and enhancement of nuclease activity by 15% 
on an RNA target. Additional positively charged mutations (A502K and E507K) further 
improve the RNA target dependent activity by 50% compared to the parent Taq4M enzyme. 

The use of data from published studies and molecular modeling, in combination with 
results accrued during the development of the present invention allowed the identification of 
regions of the proteins in which changes of amino acids would be likely to cause observable 
differences in at least one aspect of cleavage function. While regions could be targeted in this 
way, it was observed that changes in different amino acids, even if near or immediate 
neighbors in the protein, could have different effects. For example, while the A502K 
substitution created a marked increase in the RNA-dependent cleavage activity of Taq 4M, 
changing amino acid 499 from G to a K, only 3 amino acids away from 502, gave minimal 
improvement. As can be seen in the Experimental Examples, the approach of the present 
invention was to change several amino acids in a candidate region, either alone or in 
combination, then use the screening method provided in Example 1 to rapidly assess the 
effects of the changes. In this way, the rational design approach is easily applied to the task 
of protein engineering. 

In addition to the thumb, palm, and hand regions found in the polymerase domain of 
these proteins, regions that are specific to 5' nucleases and nuclease domains were examined. 
Comparative studies on a variety of 5 5 nucleases have shown that, though the amino acid 
sequences vary dramatically from enzyme to enzyme, there are structural features common to 
most. Two of these features are the helix-hairpin-helix motif (H-h-H) and the arch or loop 
structure. The H-h-H motif is believed to mediate non-sequence specific DNA binding. It 
has been found in at least 14 families of proteins, including nucleases, N-glycosylases, ligases, 
helicases, topoisomerases, and polymerases (Doherty et aL, Nucl. Acid. Res., 24:2488 [1996]). 
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The crystallographic structure of rat DNA polymerase pol p bound to a DNA template-primer 
shows non-specific hydrogen bonds between the backbone nitrogens of the pol p HhH motif 
and the phosphate oxygens of the primer of the DNA duplex (Pelletier et al 9 Science 
264:1891 [1994]). Because the HhH domain of 5' nuclease domains of Taq and Tth 
5 polymerases may function in a similar manner, it is contemplated that mutations in the HhH 
region of the enzyme alter activity. Mutations may be introduced to alter the shape and 
structure of the motif, or to change the charge of the motif to cause increased or decreased 
affinity for substrate. 

Another structure common to many 5' nucleases from diverse sources such as 

10 eukaryotes, eubacteria, archaea and phage, is the arch or loop domain. The crystal structure 
of the 5' exonuclease of bacteriophage T5 showed a distinct arch formed by two helices, one 
positively charged and one containing hydrophobic residues (Ceska et al 9 Nature 382:90 

yll [1996]). Interestingly, three residues that are conserved between T5 and Taq, Lys 83, Arg 86 
and Tyr 82 are all in the arch. These correspond to amino acids Lys 83, Arg 86, and Tyr 82 

~Tt5 in Taq DNA polymerase. The crystal structure for Taq (5 5 nuclease) has also been 

p determined (Kim et al 9 Nature 376:612 [1995]). 

" ?f; The crystal structure from the flap endonuclease-1 from Methanococcus janneschii also 

Q shows such a loop motif (Hwang et al 9 Nat. Struct. Biol, 5:707 [1998]). The backbone 
m crystal structure of Mja FEN-1 molecules may be superimposed on T5 exonuclease, Taq 
Jo 5 '-exonuclease and T4 RNase H. An interesting feature common to all of these is the long 
O loop. The loop of FEN-1 consists of a number of positively charged and aromatic residues 
and forms a hole with dimensions large enough to accommodate a single- stranded DNA 
molecule. The corresponding region in T5 exonuclease consists of three helices forming a 
helical arch. The size of the hole formed by the helical arch in T5 exonuclease is less than 
25 half that formed by the LI loop in Mj FEN-1. In T4 RNase H or Taq 5' exonuclease, this 
region is disordered. Some regions of the arch bind metals, while other regions of the arch 
contact nucleic acid substrate. Alignment of the amino-acid sequences of six 5' nuclease 
domains from DNA polymerases in the pol I family show six highly conserved sequence 
motifs containing ten conserved acidic residues (Kim et al 9 Nature 376 [1995]). 
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The effects of alterations in the arch region were examined. In Taq polymerase the 
arch region is formed by amino acids 80-95 and 96-109, Site directed mutagenesis was 
performed on the arch region. Alignment of amino acid sequences of the FEN and 
polymerase 5' nucleases suggested the design of 3 amino acid substitution mutations, P88E, 
5 P90E and G80E. These substitutions were made on the Taq4M polymerase mutant as a 
parent enzyme. Results indicated that although the background activity on the HP and X 
substrates shown in Figure 22 are tremendously suppressed in all mutants, the desirable 5' 
nuclease activity on proper substrates (IdT and IrT, Figure 24) is also reduced. Despite the 
sequence homology between Taq and Tth polymerases, they have very different activity on 
10 HP and X substrates. The alignment of the Taq and Tth polymerase arch regions also 

demonstrates regions of extensive sequence homology as well as minor differences. These 
differences led to the design of mutations L109F and A110T using Taq4M to generate Taq4M 
JJ L109F/A110T, and the mutant Taq 4M A502K/G504K/E507K/T514S to generate Taq 4M 
g L109F/A110T/A502K/G504K/E507K/T514S mutant. These two mutations have drastically 
*'i5 converted Taq4M enzyme to become more like Tth enzyme in terms of the background 
p substrate specificity while the 5 J nuclease activities on both DNA and RNA substrates are 
^ almost unchanged. 

py TV) Focused random mutagenesis 

5|0 As described above, physical studies and molecular modeling may be used alone or in 

CJ combination to identify regions of the enzymes in which changes of amino acids are likely to 
cause observable differences in at least one aspect of cleavage function. In the section above, 
use of this information was described to select and change specific amino acids or 
combinations of amino acids. Another method of generating an enzyme with altered function 

25 is to introduce mutations randomly. Such mutations can be introduced by a number of 

methods known in the art, including but not limited to, PCR amplification under conditions 
that favor nucleotide misincorporation (REF), amplification using primers having regions of 
degeneracy (i.e., base positions in which different individual, but otherwise similar 
oligonucleotides in a reaction may have different bases), and chemical synthesis. Many 

30 methods of random mutagenesis are known in the art (Del Rio et al 9 Biotechniques 17:1132 
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[1994]), and may be incorporated into the production of the enzymes of the present invention. 
The discussions of any particular means of mutagenesis contained herein are presented solely 
by way of example and not intended as a limitation. When random mutagenesis is performed 
such that only a particular region of an entire protein is varied, it can be described as "focused 
random mutagenesis." As described in the Experimental Examples, a focused random 
mutagenesis approach was applied to vary the HhH and the thumb domains some of the 
enzyme variants previously created. These domains were chosen to provide examples of this 
approach, and it is not intended that the random mutagenesis approach be limited to any 
particular domain, or to a single domain. It may be applied to any domain, or to any entire 
protein. Proteins thus modified were tested for cleavage activity in the screening reactions 
described in Example 1, using the test substrates diagrammed in Figures 22 and 24, with the 
results described in Tables 5 and 6. 

Random mutagenesis was performed on the HhH region with the parent TaqSS or 
TthDN H785A mutants. None of the 8 mutants generated showed an improvement in activity 
compared to the parent enzyme (Table 5). In fact, mutations of the region between residues 
198-205 have about 2-5 fold lower activity on both DNA and RNA substrates, suggesting that 
this region is essential for substrate recognition. Mutagenesis in the thumb region resulted in 
new mutations that improved 5' nuclease activity by 20-100% on a DNA target and about 
10% on an RNA target (Table 6). 

Numerous amino acids in each of the distinct subdomains play roles in substrate 
contact. Mutagenesis of these may alter substrate specificity by altering substrate binding. 
Moreover, mutations introduced in amino acids that do not directly contact the substrate may 
also alter substrate specificity through longer range or general conformation altering effects. 
These mutations may be introduced by any of several methods known in the art, including, 
but not limited to random mutagenesis, site directed mutagenesis, and generation of chimeric 
proteins. 

As noted above, numerous methods of random mutagenesis are known in the art. The 
methods applied in the focused random mutagenesis described herein my be applied to whole 
genes. It is also contemplated that additional useful chimerical constructs may be created 
through the use of molecular breeding {See e.g., U.S. Pat. No. 5,837,458 and PCT 
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Publications WO 00/18906, WO 99/65927, WO 98/31837, and WO 98/27230, herein 
incorporated by reference in their entireties). Regardless of the mutagenesis method chosen, 
the rapid screening method described herein provides a fast and effective means of identifying 
beneficial changes within a large collection of recombinant molecules. This makes the 
random mutagenesis procedure a manageable and practical tool for creating a large collection 
of altered 5' nucleases having beneficial improvements. The cloning and mutagenesis 
strategies employed for the enzymes used as examples are applicable to other thermostable 
and non-thermostable Type A polymerases, since DNA sequence similarity among these 
enzymes is very high. Those skilled in the art would understand that differences in sequence 
would necessitate differences in cloning strategies, for example, the use of different restriction 
endonucleases may be required to generate chimeras. Selection of existing alternative sites, or 
introduction via mutagenesis of alternative sites are well established processes and are known 
to one skilled in the art. 

Enzyme expression and purification can be accomplished by a variety of molecular 
biology methods. The examples described below teach one such method, though it is to be 
understood that the present invention is not to be limited by the method of cloning, protein 
expression, or purification. The present invention contemplates that the nucleic acid construct 
be capable of expression in a suitable host. Numerous methods are available for attaching 
various promoters and 3 5 sequences to a gene structure to achieve efficient expression. 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. 

In the disclosure which follows, the following abbreviations apply: Afu 
(Archaeoglobus fulgidus); Mth (Methanobacterium thermoautotrophicum); Mja 
(Methanococcus jannaschii); Pfu (Pyrococcus furiosus); Pwo (Pyrococcus woesei); Taq 
(Thermits aquaticus)\ TaqPol (T. aquaticus DNA polymerase I); StfPol (the Stoffel fragment 
of TaqPol); EclPol (E. coli DNA polymerase I); Tth (Thermus thermophilus); TthPol (T. 
thermophilus DNA polymerase I); Tsc (Thermus scotoductus); TscPol (Thermus scotoductus 
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DNA polymerase I); Tfi {Thermus filiformis ); TfiPol (Thermus filiformis DNA polymerase I); 
Ex. (Example); Fig. (Figure); °C (degrees Centigrade); g (gravitational field); vol (volume); 
w/v (weight to volume); v/v (volume to volume); BSA (bovine serum albumin); CTAB 
(cetyltrimethylammonium bromide); HPLC (high pressure liquid chromatography); DNA 
5 (deoxyribonucleic acid); p (plasmid); jlxI (microliters); ml (milliliters); jag (micrograms); 

pmoles (picomoles); mg (milligrams); M (molar); mM (milliMolar); \iM (microMolar); nm 
(nanometers); kdal (kilodaltons); OD (optical density); EDTA (ethylene diamine tetra-acetic 
acid); FITC (fluorescein isothiocyanate); SDS (sodium dodecyl sulfate); NaP0 4 (sodium 
phosphate); Tris (tris(hydroxymethyl)-aminomethane); PMSF (phenylmethylsulfonylfluoride); 
10 TAE (Tris-acetate-EDTA, i.e., 40mM Tris-Acetate, 1 mM EDTA); TBE (Tris-Borate-EDTA, 
i.e., Tris buffer titrated with boric acid rather than HC1 and containing EDTA); PBS 
(phosphate buffered saline); PPBS (phosphate buffered saline containing 1 mM PMSF); 
y| PAGE (polyacrylamide gel electrophoresis); Tween (polyoxyethylene-sorbitan); ATCC 
Sf I (American Type Culture Collection, Rockville, MD); DSMZ (Deutsche Sammlung von 

HS Mikroorganismen und Zellculturen, Braunschweig, Germany); Sigma (Sigma Chemical 
□ Company, St. Louis, MO); Dynal (Dynal A.S., Oslo, Norway); Gull (Gull Laboratories, Salt 

f* Lake City, UT); Epicentre (Epicentre Technologies, Madison, WI); MJ Research (MJ 

Cl Research, Watertown,MA); National Biosciences (Plymouth, MN); New England Biolabs 

m (Beverly, MA); Novagen (Novagen, Inc., Madison, WI); Perkin Elmer (Norwalk, CT); 

220 Promega Corp. (Madison, WI); Stratagene (Stratagene Cloning Systems, La Jolla, CA); 
O Clonetech (Clonetech, Palo Alto, CA); Pharmacia (Pharmacia, Piscataway, NJ); Milton Roy 
(Milton Roy, Rochester, NY); Amersham (Amersham International, Chicago, IL); USB (U.S. 
Biochemical, Cleveland, OH); and Qiagen (Valencia, CA). 

25 EXAMPLE 1 

Rapid screening of colonies for 5' nuclease activity 

The native 5' nucleases and the enzymes of the present invention can be tested directly 
for a variety of functions. These include, but are not limited to, 5' nuclease activity on RNA 
30 or DNA targets and background specificity using alternative substrates representing structures 
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that may be present in a target detection reaction. Examples of nucleic acid molecules having 
suitable test structures are shown schematically in Figures 18A-D and Figures 21-24. The 
screening techniques described below were developed to quickly and efficiently characterize 
5' nucleases and to determine whether the new 5' nucleases have any improved or desired 
activities. Enzymes that show improved cycling rates on RNA or DNA targets, or that result 
in reduced target-independent cleavage merit more thorough investigation. In general, the 
modified proteins developed by random mutagenesis were tested by rapid colony screen on 
the substrates shown in Figures 18A and 18B. A rapid protein extraction was then done, and 
a test of activity on alternative structures, (e.g., as shown in Figures 18C-D) was performed 
using the protein extract. Either the initial screen, or further screening and characterization of 
enzymes for improved activity may be performed using other cleavage complexes, such as 
those diagrammed in Figures 21-24. It is not intended that the scope of the invention be 
limited by the particular sequences used to form such test cleavage structures. One skilled in 
the art would understand how to design and create comparable nucleic acids to form 
analogous structures for rapid screening. 

This order of testing may be chosen to reduce the number of tests overall, to save time 
and reagents. The order of testing for enzyme function is not intended to be a limitation on 
the present invention. Those mutants that showed reasonable cycling rates with the RNA or 
DNA targets may then be cultured overnight, and a rapid protein extraction done. 
Alternatively, any subset or all of the cleavage tests may be done at the same time. 

For convenience, each type of rapid screen may be done on a separate microtiter plate. 
For example, one plate may be set up to test RNA INVADER activity, one plate set up to test 
for DNA INVADER activity. As many as 90 different colonies may be screened on one 
plate. The colonies screened can be from a variety of sources, such as clones of unaltered 
(native) 5' nucleases, from one mutagenesis reaction (e.g., many colonies from a single plate) 
or from a variety of reactions (colonies selected from multiple plates). 

Ideally, positive and negative controls should be run on the same plate as the mutants, 
using the same preparation of reagents. One example of a good positive control is a colony 
containing the unmodified enzyme, or a previously modified enzyme whose activity is to be 
compared to new mutants. For example, if a mutagenesis reaction is performed on the Taq 
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DN RX HT construct (described below), the unmodified Taq DN RX HT construct would be 
chosen as the standard for comparing the effects of mutagenesis on enzymatic activity. 
Additional control enzymes may also be incorporated into the rapid screening test. For 
example, Tth DN RX HT (described below; unless otherwise specified, the TaqPol and TthPol 
enzymes of the following discussion refer to the DN RX HT derivative) may also be included 
as a standard for enzymatic activity along with the Taq DN RX HT. This would allow a 
comparison of any altered enzymes to two known enzymes having different activities. A 
negative control should also be run to determine the background reaction levels (i.e., cleavage 
or probe degradation due to sources other than the nucleases being compared). A good 
negative control colony would be one containing only the vector used in the cloning and 
mutagenesis, for example, colonies containing only the pTrc99A vector. 

Two factors that may influence the number of colonies chosen from a specific 
mutagenesis reaction for the initial rapid screen are 1) total number of colonies obtained from 
the mutagenesis reaction, and 2) whether the mutagenesis reaction was site- specific or 
randomly distributed across a whole gene or a region of a gene. For example, if only 5-10 
colonies are present on the plate, all colonies can easily be tested. If hundreds of colonies are 
present, a subset of these may be analyzed. Generally 10-20 colonies are tested from a 
site-specific mutagenesis reaction, while 80 to 100 or more colonies are routinely tested from 
a single random mutagenesis reaction. 

Where indicated, the altered 5' nucleases described in these experimental examples 
were tested as detailed below. 

A. Rapid screen: INVADER activity on RNA target (Figure 18A) 

A 2X substrate mix was prepared, comprising 20 mM MOPS, pH 7.5, 10 mM MgS0 4 , 
200 mM KC1, 2 jiM FRET-probe oligo SEQ ID NO:21 
(5'-Fl-CGCT-cy3-TCTCGCTCGC-3'), 1 (iM INVADER oligo SEQ ID NO:22 
(5 ACGGAACGAGCGTCTTTG-3 '), and 4 nM RNA target SEQ ID NO:23 (5'-GCG AGC 
GAGA CAG CGA AAG ACG CUC GUU CCG U-3'). Five ^1 of the 2X substrate mix were 
dispensed into each sample well of a 96 well microtiter plate (Low Profile MULTIPLATE 96, 
MJ. Research, Inc.). 
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Cell suspensions were prepared by picking single colonies (mutants, positive control, 
and negative control colonies) and suspending each one in 20jLtl of water. This can be done 
conveniently in a 96 well microtiter plate format, using one well per colony. 

Five pi of the cell suspension was added to the appropriate test well such that the final 
5 reaction conditions were 10 mM MOPS, pH 7.5, 5 mM MgS0 4? 100 mM KC1, 1 pM 

FRET-probe oligo, 0.5 jaM INVADER oligo, and 2 nM RNA target. The wells were covered 
with 10 pi of Clear Chill-out 14 (MJ. Research, Inc.) liquid wax, and the samples were 
heated at 85°C for 3 minutes, then incubated at 59°C for 1 hour. After the incubation, the 
plates were read on a Cytofluor flourescense plate reader using the following parameters: 
10 excitation 485/20, emission 530/30. 

B. Rapid screen: INVADER activity on DNA target (Figure 18B) 

y| A 2X substrate mix was prepared, comprising 20 mM MOPS, pH 7.5, 10 mM MgS0 4 , 

^ 200 mM KC1, 2 jxM FRET-probe oligo SEQ ID NO:21 

(5 5 -Fl-CGCT-Cy3-TCTCGCTCGC-3') ? 1 pM INVADER oligo SEQ ID NO:22 

j| (5 '-ACGGAACGAGCGTCTTTG-3 ? ), 1 nM DNA target SEQ ID NO:24 (5'-GCG AGC 

* GAGA CAG CGA AAG ACG CTC GTT CCG T-3'). Five (il of the 2X substrate mix was 

u dispensed into each sample well of a 96 well microtiter plate (MJ Low Profile). 

Cell suspensions were prepared by picking single colonies (mutants, positive control 

2|0 and negative control colonies) and suspending them in 20 jal of water, generally in a 96 well 

O microtiter plate format. 

5 jliI of the cell suspension were added to the appropriate test well such that the final 
reaction conditions were 10 mM MOPS, pH 7.5, 5 mM MgS0 4 , 100 mM KC1, 1 \M 
FRET-probe oligo, 0.5 [iM INVADER oligo, and 0.5 nM DNA target. Wells were covered 

25 with 10|u 1 of Clear Chill-out 14 (MJ. Research, Inc.) liquid wax, and the reactions were 

heated at 85°C for 3 minutes, then incubated at 59°C for 1 hour. After the hour incubation, 
the plate were read on a Cytofluor flourescan plate reader using the following parameters: 
excitation 485/20, emission 530/30, gain 40, reads per well 10. 

30 
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C. Rapid protein extraction (crude cell lysate) 

Those mutants that gave a positive or an unexpected result in either the RNA or DNA 
INVADER assay were further analyzed, specifically for background activity on the 
X-structure or the hairpin substrate (Figure 18C and D, respectively). A rapid colony screen 
5 format can be employed, as described above. By simply changing the substrate, tests for 

background or aberrant enzymatic activity can be done. Another approach would be to do a 
rapid protein extraction from a small overnight culture of positive clones, and then test this 
crude cell lysate for additional protein function. One possible rapid protein extraction 
procedure is detailed below. Two to five ml of LB (containing the appropriate antibiotic for 
10 plasmid selection; See e.g., Maniatis, books 1,2 and 3) were inoculated with the remaining 

volume of the 20 \xl water-cell suspension and incubated at 37°C overnight. About 1.4 ml of 
the culture were transferred to a 1.5 ml microcentrifuge tube, and microcentrifuged at top 
yg speed (e.g., 14,000 rpm in an Eppendorf 5417 table top microcentrifuge), at room temperature 

for 3-5 minutes to pellet the cells. The supernatant was removed, and the cell pellet was 
% i5 suspended in 100 pi of TES buffer pH 7.5 (Sigma). Lysozyme (Promega) was added to a 
fj final concentration of 0.5|ig/|al and samples were incubated at room temperature for 30 
^ minutes. Samples were then heated at 70°C for 10 minutes to inactivate the lysozyme, and 
Q the cell debris was pelleted by microcentrifugation at top speed for 5 minutes. The 
fsi supernatant was removed and this crude cell lysate was used in the following enzymatic 
j|0 activity assays. 

D. Rapid screen: background specificity X structure substrate (Figure 18C) 

Reactions were performed under conditions as detailed above. One pi of crude cell 
lysate was added to 9 jxl of reaction components for a final volume of 10 jol and final 

25 concentrations of 10 mM MOPS, pH 7.5, 5 mM MgS0 4 , 100 mM KC1, 1 (aM FRET-probe 
oligo (SEQ ID NO:21), 0.5pM X-structure INVADER oligo SEQ ID N0:25 
(5 '-ACGGAACGAGCGTCTTTCATCTGTCAATC-3 and 0.5 nM DNA target (SEQ ID 
NO:24). Wells were covered with 10 \il of Clear Chill-out 14 (MJ. Research, Inc.) liquid 
wax, and the reactions were heated at 85°C for 3 minutes, then incubated at 59°C for 1 hour. 

30 After the incubation, the plates were read on a Cytofluor fluorescence plate reader using the 
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following parameters: excitation 485/20, emission 530/30, gain 40, reads per well 10. 

E. Rapid screen: background specificity hairpin substrate (Figure 18D) 

Reactions were performed under conditions as detailed above. One pi of crude cell 
5 lysate was added to 9 jal of reaction components for a final volume of 10 \il and final 

concentrations of 10 mM MOPS, pH 7.5, 5 mM MgS0 4 , 100 mM KC1, 1 \iM FRET-probe 
oligonucleotide (SEQ ID NO:21), and 0.5 nM DNA target (SEQ ID NO:24). Wells were 
covered with 10 (al of Clear Chill-out 14 (MJ. Research, Inc.) liquid wax, and the reactions 
were heated at 85°C for 3 minutes, then incubated at 59°C for 1 hour. After the hour 
10 incubation, the plate were read on a Cytofluor plate reader using the following parameters: 
excitation 485/20, emission 530/30, gain 40, reads per well 10. 

yi F. Activity assays with IrTl and IdT targets (Figures 24) 

:fj The 5' nuclease activities assays were carried out in 10 pi of a reaction containing 10 

H5 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 pg/ml tRNA, 100 mM KC1 
S and 5 mM MgS0 4 . The probe concentration (SEQ ID NO: 26) was 2 mM. The substrates 

* (IrTl (SEQ ID NO: 35) or IdT (SEQ ID NO: 36) at 10 or 1 nM final concentration 

O respectively) and approximately 20 ng of an enzyme, prepared as in Example 3, were mixed 

Si with the above reaction buffer and overlaid with Chill-out (MJ Research) liquid wax. 

220 Reactions were brought up to reaction temperature 57 °C, started by addition of MgS0 4 , and 
Q incubated for 10 min. Reactions were then stopped by the addition of 10 |ul of 95% 

formamide containing 10 mM EDTA and 0.02% methyl violet (Sigma). Samples were heated 
to 90°C for 1 minute immediately before electrophoresis through a 20% denaturing acrylamide 
gel (19:1 cross-linked), with 7 M urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1.4 mM 
25 EDTA. Unless otherwise indicated, 1 [il of each stopped reaction was loaded per lane. Gels 
were then scanned on an FMBIO-100 fluorescent gel scanner (Hitachi) using a 505 nm filter. 
The fraction of cleaved product was determined from intensities of bands corresponding to 
uncut and cut substrate with FMBIO Analysis software (version 6.0, Hitachi). The fraction of 
cleavage product did not exceed 20% to ensure that measurements approximated initial 
30 cleavage rates. The turnover rate was defined as the number of cleaved signal probes 
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generated per target molecule per minute under these reaction conditions (1/min). 

G. Activity assays with X structure (X) and hairpin (HP) targets (Figures 22) 

The 5 J nuclease activity assays were carried out in 10 jxl of a reaction containing 10 
mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 /ig/ml tRNA, 100 mM KC1 
and 5 mM MgS0 4 . Each oligo for formation of either the hairpin structure assembly (22A, 
SEQ ID NOS: 29 and 30) assembly or the X structure assembly (22B, SEQ ID NOS: 29-31) 
was added to a final concentration of 1 |im, and approximately 20 ng of test enzyme prepared 
as described in Example 3, were mixed with the above reaction buffer and overlaid with 
Chill-out (MJ Research) liquid wax. Reactions were brought up to reaction temperature 60 
°C, started by addition of MgS0 4 , and incubated for 10 min. Reactions were then stopped by 
the addition of 10 jal of 95% formamide containing 10 mM EDTA and 0.02% methyl violet 
(Sigma). Samples were heated to 90°C for 1 minute immediately before electrophoresis 
through a 20% denaturing acrylamide gel (19:1 cross-linked), with 7 M urea, and in a buffer 
of 45 mM Tris-borate, pH 8.3, 1.4 mM EDTA. Unless otherwise indicated, 1 \xl of each 
stopped reaction was loaded per lane. Gels were then scanned on an FMBIO-100 fluorescent 
gel scanner (Hitachi) using a 505 nm filter. The fraction of cleaved product was determined 
from intensities of bands corresponding to uncut and cut substrate with FMBIO Analysis 
software (version 6.0, Hitachi). The fraction of cleavage product did not exceed 20% to 
ensure that measurements approximated initial cleavage rates. The turnover rate was defined 
as the number of cleaved signal probes generated per target molecule per minute under these 
reaction conditions (1/min). 

H. Activity assays with human IL-6 target (Figure 10) 

The 5' nuclease activities assays were carried out in 10 (il reactions containing 10 
mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 jug/ml tRNA, 100 mM KC1 
and 5 mM MgS0 4 . Reactions comprising the DNA IL-6 substrate contained 0.05 nM IL-6 
DNA target (SEQ ID NO: 18) and 1 (iM of each probe (SEQ ID NO: 16) and INVADER 
(SEQ ID NO: 15) oligonucleotides, and were carried out at 60°C for 30 min. Reactions 
comprising the IL-6 RNA target (SEQ ED NO: 17) were performed under the same 
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conditions, except that the IL-6 RNA target concentration was 1 nM and the reactions were 
performed at 57°C for 60 min. Each reaction contained approximately 20 ng of test enzyme, 
prepared as described in Example 3. 

5 L Activity assays with synthetic r25mer target (Figure 23) 

Reactions comprising the synthetic r25mer target (SEQ ID NO: 34) were carried out 
under the same reaction conditions (10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet 
P-40, 10 ^ig/ml tRNA, 100 mM KC1 and 5 mM MgS0 4 ) and 1 jiM of each probe (SEQ ID 
NO: 32) and INVADER (SEQ ID NO: 33) oligonucleotides, except that the r25mer target 
10 concentration was 5 nM and the reactions were performed at 58°C for 60 min. 

Approximately 20 ng of each test enzyme was added to the reactions. Enzymes were 
prepared as described in Example 3. 

?f : Any of the tests described above can be modified to derive the optimal conditions for 

%15 enzymatic activity. For example, enzyme titrations can be done to determine the optimal 

S enzyme concentration for maximum cleavage activity, and lowest background signal. By way 

+ of example, but not by way of limitation, many of the mutant enzymes were tested at 10, 20 

Q and 40 ng amounts. Similarly, a temperature titration can also be incorporated into the tests, 

fy Since modifying the structure of a protein can alter its temperature requirements, a range of 

jgO temperatures can be tested to identify the condition best suited for the mutant in question. 
O Examples of the results from such screens (using approximately 20 ng of the mutant 

enzyme) are shown in Tables 2-7, and Figures 12, 14, 15, 19, and 25. 

EXAMPLE 2 

25 Cloning and Expression of 5 5 nucleases of DNA polymerases and mutant polymerases 

A. DNA polymerases of Thermus aquaticus and Thermus thermophilus 

1. Cloning of TaqPol and TthPol 

30 Type A DNA polymerases from eubacteria of the genus Thermus share extensive 
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protein sequence identity (90% in the polymerization domain, using the Lipman-Pearson 
method in the DNA analysis software from DNAStar, WI) and behave similarly in both 
polymerization and nuclease assays. Therefore, the genes for the DNA polymerase of 
Thermus aquaticus (TaqPol), Thermus thermophilus (TthPol) and Thermus scotoductus were 
used as representatives of this class. Polymerase genes from other eubacterial organisms, 
including, but not limited to, Escherichia coli, Streptococcus pneumoniae, Mycobacterium 
smegmatis, Thermus thermophilus, Thermus sp., Thermotoga maritima, Thermosipho 
africanus, and Bacillus stearothermophilus are equally suitable. 

a. Initial TaqPol Isolation: mutant TaqA/G 

The Taq DNA polymerase gene was amplified by polymerase chain reaction from 
genomic DNA from Thermus aquaticus, strain YT-1 (Lawyer et al, supra), using as primers 
the oligonucleotides described in SEQ ID NOS:37 and 38. The resulting fragment of DNA 
has a recognition sequence for the restriction endonuclease EcoRL at the 5' end of the coding 
sequence and a BgRl sequence at the 3' end of the coding strand. Cleavage with Bglll leaves 
a 5' overhang or "sticky end" that is compatible with the end generated by BamBl. The 
PCR-amplified DNA was digested with EcoRL and BamRI. The 2512 bp fragment containing 
the coding region for the polymerase gene was gel purified and then ligated into a plasmid 
that contains an inducible promoter. 

In one embodiment of the invention, the pTTQ18 vector, which contains the hybrid 
trp-lac (tac) promoter, was used (M.J.R. Stark, Gene 5:255 [1987]). The tac promoter is 
under the control of the E. coli lac repressor protein. Repression allows the synthesis of the 
gene product to be suppressed until the desired level of bacterial growth has been achieved, at 
which point repression is removed by addition of a specific inducer, 

isopropyl-b-D-thiogalactopyranoside (IPTG). Such a system allows the controlled expression 
of foreign proteins that may slow or prevent growth of transformants. 

Particularly strong bacterial promoters, such as the synthetic Ytac, may not be 
adequately suppressed when present on a multiple copy plasmid. If a highly toxic protein is 
placed under control of such a promoter, the small amount of expression leaking through, 
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even in the absence of an inducer, can be harmful to the bacteria. In another embodiment of 
the invention, another option for repressing synthesis of a cloned gene product is 
contemplated. A non-bacterial promoter from bacteriophage T7, found in the plasmid vector 
series pET-3, was used to express the cloned mutant Taq polymerase genes (Studier and 
Moffatt, J. Mol. Biol, 189:113 [1986]). This promoter initiates transcription only by T7 
RNA polymerase. In a suitable strain, such as BL21(DE3)pLYS, the gene for the phage T7 
RNA polymerase is carried on the bacterial genome under control of the lac operator. This 
arrangement has the advantage that expression of the multiple copy gene (on the plasmid) is 
completely dependent on the expression of T7 RNA polymerase, which is easily suppressed 
because it is present in a single copy. 

These are just two examples of vectors having suitable inducible promoters. Others 
are well known to those skilled in the art, and it is not intended that the improved nucleases 
of the present invention be limited by the choice of expression system. 

For ligation into the pTTQ18 vector, the PCR product DNA containing the Taq 
polymerase coding region (termed mutTaq for reasons discussed below, SEQ ID NO:39) was 
digested with EcoRI and Bglll and this fragment was ligated under standard "sticky end" 
conditions (Sambrook et al. Molecular Cloning, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, pp. 1 .63-1. 69 [1989]) into the EcoRI and BamEI sites of the plasmid vector 
pTTQ18. Expression of this construct yields a translational fusion product in which the first 
two residues of the native protein (Met-Arg) are replaced by three from the vector 
(Met-Asn-Ser), but the remainder of the PCR product's protein sequence is not changed (SEQ 
ID NO:40). The construct was transformed into the JM109 strain of E. coli, and the 
transformants were plated under incompletely repressing conditions that do not permit growth 
of bacteria expressing the native protein. These plating conditions allow the isolation of 
genes containing pre-existing mutations, such as those that result from the infidelity of Taq 
polymerase during the amplification process. 

Using this amplification/selection protocol, a clone was isolated containing a mutated 
Taq polymerase gene (mut7a#). The mutant was first detected by its phenotype, in which 
temperature-stable 5 J nuclease activity in a crude cell extract was normal, but polymerization 
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activity was almost absent (approximately less than 1% of wild type Taq polymerase activity). 
Polymerase activity was determined by primer extension reactions. The reactions were carried 
out in 10 (il of buffer containing 10 mM MOPS, pH 7.5, 5 mM MgS0 4 , 100 mM KCL In 
each reaction, 40 ng of enzyme were used to extend 10 |uM (dT) 25 _ 30 primer in the preesnce of 
either 10 |iM poly (A) 2g6 or 1 juM poly (dA) 273 template, 45 jaM dTTP and 5 \xM Fl-dUTP at 
60°C for 30 minutes. Reactions were stopped with 10 jal of stop solution (95% formamide, 
10 mM EDTA, 0.02% methyl violet dye). Samples (3 jul) were fractionated on a 15% 
denaturing acrylamide gel (19:1 crossed-linked) and the fraction of incorporated Fl-dUTP was 
quantitated using an FMBIO-100 fluorescence gel scanner (Hitachi) equipped with a 505 nm 
emission filter. 

DNA sequence analysis of the recombinant gene showed that it had changes in the 
polymerase domain resulting in two amino acid substitutions: an A to G change at nucleotide 
position 1394, which causes a Glu to Gly change at amino acid position 465 (numbered 
according to the natural nucleic and amino acid sequences, SEQ ID NOS:l and 4), and 
another A to G change at nucleotide position 2260, which causes a Gin to Arg change at 
amino acid position 754. Because the Gin to Gly mutation is at a nonconserved position and 
because the Glu to Arg mutation alters an amino acid that is conserved in virtually all of the 
known Type A polymerases, the latter mutation is most likely the one responsible for 
curtailing the synthesis activity of this protein. The nucleotide sequence for the construct is 
given in SEQ ID NO:39. The enzyme encoded by this sequence is referred to as Taq A/G. 

b. Initial TthPol Isolation 

The DNA polymerase enzyme from the bacterial species Thermus thermophilus (Tth) 
was produced by cloning the gene for this protein into an expression vector and 
overproducing it in E. coli cells. Genomic DNA was prepared from 1 vial of dried Thermus 
thermophilus strain HB-8 from ATCC (ATCC #27634). The DNA polymerase gene was 
amplified by PCR using the following primers: 

5 ' -C ACGAATTCCGAGGCGATGCTTCCGCTC-3 ' (SEQ ID NO:41) and 

5 ' -TCGACGTCGACT AACCCTTGGCGGAAAGCC-3 ' (SEQ ID NO:42). The resulting PCR 
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product was digested with EcoBl and Sail restriction endonucleases and inserted into 
EcoRl/Sal I digested plasmid vector pTrc99G (described in Example 2C1) to create the 
plasmid pTrcTth-L This Tth polymerase construct is missing a single nucleotide that was 
inadvertently omitted from the 5 5 oligonucleotide, resulting in the polymerase gene being out 
of frame. This mistake was corrected by site specific mutagenesis of pTrcTth-1 as described 
in Examples 4 and 5 using the following oligonucleotide: 

5 '-GCATCGCCTCGGAATTCATGGTC-3 ' (SEQ ID NO:43), to create the plasmid 
pTrcTth-2. The protein and the nucleic acid sequence encoding the protein are referred to as 
TthPol, and are listed as SEQ ID NOS:6 and 3 respectively. 

c. Large Scale preparation of recombinant proteins 

The recombinant proteins were purified by the following technique which is derived 
from a Taq DNA polymerase preparation protocol (Engelke et aL, Anal. Biochem., 191:396 
[1990]) as follows. E. coli cells (strain JM109) containing either pTrc99A TaqPol, 
pTrc99GTthPol were inoculated into 3 ml of LB containing 100 mg/ml ampicillin and grown 
for 16 hrs at 37°C. The entire overnight culture was inoculated into 200 ml or 350 ml of LB 
containing 100 mg/ml ampicillin and grown at 37°C with vigorous shaking to an A 600 of 0.8. 
IPTG (1 M stock solution) was added to a final concentration of 1 mM and growth was 
continued for 16 hrs at 37°C. 

The induced cells were pelleted and the cell pellet was weighed. An equal volume of 
2X DG buffer (100 mM Tris-HCl, pH 7.6, 0.1 mM EDTA) was added and the pellet was 
suspended by agitation. Fifty mg/ml lysozyme (Sigma) were added to 1 mg/ml final 
concentration and the cells incubated at room temperature for 15 min. Deoxycholic acid 
(10% solution) was added dropwise to a final concentration of 0.2 % while vortexing. One 
volume of H 2 0 and 1 volume of 2X DG buffer were added, and the resulting mixture was 
sonicated for 2 minutes on ice to reduce the viscosity of the mixture. After sonication, 3 M 
(NH 4 ) 2 S0 4 was added to a final concentration of 0.2 M, and the lysate was centrifuged at 
14000 x g for 20 min at 4°C. The supernatant was removed and incubated at 70°C for 60 
min at which time 10% polyethylimine (PEI) was added to 0.25%. After incubation on ice 
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for 30 min., the mixture was centrifuged at 14,000 x g for 20 min at 4°C. At this point, the 
supernatant was removed and the protein precipitated by the addition of (NH 4 ) 2 S0 4 as follows. 

Two volumes of 3 M (NH 4 ) 2 S0 4 were added to precipitate the protein. The mixture 
was incubated overnight at room temperature for 16 hrs centrifuged at 14,000 x g for 20 min 
at 4°C. The protein pellet was suspended in 0.5 ml of Q buffer (50 mM Tris-HCl, pH 8.0, 
0.1 mM EDTA, 0.1% Tween 20). For the Mja FEN-1 preparation, solid (NH 4 ) 2 S0 4 was 
added to a final concentration of 3 M (-75% saturated), the mixture was incubated on ice for 
30 min, and the protein was spun down and suspended as described above. 

The suspended protein preparations were quantitated by determination of the A 279 
dialyzed and stored in 50% glycerol, 20 mM Tris HC1, pH8.0, 50 mM KC1, 0.5% Tween 20, 
0.5% Nonidet P-40, with 100 |ug/ml BSA. 

B, DNA polymerases of Thermus filiformis and Thermus scotoductus 
L Cloning of Thermus filiformis and Thermus scotoductus 

One vial of lyophilized Thermus filiformis (Tfi) obtained from DSMZ (Deutsche 
Sammlung von Mikroorganismen und Zellculturen, Braunschweig, Germany, strain #4687) 
was rehydrated in 1 ml of Castenholz medium (DSMZ medium 86) and inoculated into 500 
ml of Castenholz medium preheated to 50°C. The culture was incubated at 70°C with 
vigorous shaking for 48 hours. After growth, the cells were harvested by centrifugation at 
8000 x g for 10 minutes, the cell pellet was suspended in 10 ml of TE (10 mM TrisHCL, pH 
8.0, 1 mM EDTA), and the cells were frozen at -20°C in 1 ml aliquots. A 1 ml aliquot was 
thawed, lysozyme was added to 1 mg/ml, and the cells were incubated at 23 °C for 30 
minutes. A solution of 20% SDS (sodium dodecyl sulfate) was added to a final concentration 
of 0.5% followed by extraction with buffered phenol. The aqueous phase was further 
extracted with 1:1 phenolxhloroform, and extracted a final time with chloroform. One-tenth 
volume of 3 M sodium acetate, pH 5.0 and 2.5 volumes of ethanol were added to the aqueous 
phase and mixed. The DNA was pelleted by centrifugation at 20,000 x g for 5 minutes. The 
DNA pellet was washed with 70% ethanol, air dried and resuspended in 200 \xl of TE and 
used directly for amplification. Thermus scotoductus (Tsc, ATCC # 51532) was grown and 
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genomic DNA was prepared as described above for Thermus filiformis. 

The DNA polymerase I gene from Tfi (GenBank accession #AF030320) could not be 
amplified as a single fragment. Therefore, it was cloned in 2 separate fragments into the 
expression vector pTrc99a, The 2 fragments overlap and share a Not I site which was created 
by introducing a silent mutation at position 1308 of the Tfi DNA polymerase open reading 
frame (ORF) in the PCR oligonucleotides. The V half of the gene was amplified using the 
Advantage cDNA PCR kit (Clonetech) with the following oligonucleotides; 
5 ' - AT AGCC ATGGTGGAGCGGCCGCTCTCCCGG (SEQ ID NO:44) and 
5 ' - AAGCGTCGACTC AATCCTGCTTCGCCTCC AGCC (SEQ ID NO:45). The PCR product 
from this reaction was approximately 1200 base pairs in length. It was cut with the restriction 
enzymes Not I and Sal I, and the resulting DNA was ligated into pTrc99a cut with NotI and 
Sail to create pTrc99a-Tfi3\ The 5' half of the gene was amplified as described above using 
the following two primers; 5 ' AATCGAATTCACCCCACTTTTTGACCTGGAGG (SEQ ID 
NO:46) and 5'-CCGGGAGAGCGGCCGCTCCAC (SEQ ID NO:47). The resulting 1300 base 
pair fragment was cut with restriction enzymes Eco RI and Not I and ligated into 
pTrc99a-Tfi3 5 cut with Notl and EcoRL to produce pTrc99a-TfiPol, SEQ ID NO:48 (the 
corresponding amino acid sequence is listed in SEQ ID NO:155). 

The DNA polymerase I gene from Thermus scotoductus was amplified using the 
Advantage cDNA PCR kit (Clonetech) using the following two primers; 
5 '-ACTGGAATTCCTGCCCCTCTTTGAGCCCAAG (SEQ ID NO:49) and 
5 ' - AACAGTCGACCT AGGCCTTGGCGGAAAGCC (SEQ ID NO:50). The PCR product 
was cut with restriction enzymes Eco RI and Sal I and ligated into Eco RI, Sal I cut pTrc99a 
to create pTrc99a-TscPol SEQ ID NO:51 (the corresponding amino acid sequence is listed in 
SEQ ID NO:268). 

2. Expression and purification of Thermus filiformis and Thermus scotoductus 

Plasmids were transformed into protease deficient E. coli strain BL21 (Novagen) or 
strain JM109 (Promega Corp., Madison, WI) for protein expression. Flasks containing 200 ml 
of LB containing 100 ju,g/ml ampicillin were inoculated with either a single colony from an 
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LB plate or from a frozen stock of the appropriate strain. After several hours of growth at 
37°C with vigorous shaking, cultures was induced by the addition of 200 ul of 1 M 
isothiopropyl-galatoside (IPTG). Growth at 37°C was continued for 16 hours prior to harvest. 
Cells were pelleted by centrifugation at 8000 x g for 15 minutes followed by suspension of 
the cell pellet in 5 ml of TEN (10 mM TrisHCl, pH 8.0, 1 raM EDTA, 100 mM NaCl). 100 
ul of 50 mg/ml lysozyme were added and the cells incubated at room temperature for 15 
minutes. Deoxycholic acid (10%) was added to a final concentration of 0.2%. After 
thorough mixing, the cell lysates were sonicated for 2 minutes on ice to reduce the viscosity 
of the mixture. Cellular debris was pelleted by centrifugation at 4°C for 15 minutes at 20,000 
x g. The supernatant was removed and incubated at 70°C for 30 min after which 10% 
polyethylimine (PEI) was added to 0.25%. After incubation on ice for 30 minutes, the 
mixture was centrifuged at 20,000 x g for 20 min at 4°C. At this point, the supernatant 
containing the enzyme was removed, and the protein was precipitated by the addition of 1.2 g 
of ammonium sulfate and incubation at 4°C for 1 hour. The protein was pelleted by 
centrifugation at 4°C for 10 minutes at 20,000 x g. The pellet was resuspended in 4 ml of 
HPLC Buffer A (50 mM TrisHCl, pH 8.0, 1 mM EDTA). The protein was further purified 
by affinity chromatography using an Econo-Pac heparin cartridge (Bio-Rad) and a Dionex DX 
500 HPLC instrument. Briefly, the cartridge was equilibrated with HPLC Buffer A, and the 
enzyme extract was loaded on the column and eluted with a linear gradient of NaCl (0-2 M) 
in the same buffer. Pure protein elutes between 0.5 and 1 M NaCl. The enzyme peak was 
collected and dialyzed in 50% glycerol, 20 mM Tris HC1, pH 8, 50 mM KC1, 0.5% Tween 
20, 0.5% Nonidet P40, 100 mg/ml BSA. 

C. Generation of polymerase mutants with reduced polymerase activity but 
unaltered 5' nuclease activity 

All mutants generated in section C were expressed and purified as described in 
Example 2A1C. 
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1. Modified TaqPol Genes: TaqDN 

A polymerization deficient mutant of Taq DNA polymerase called TaqDN was 
constructed. TaqDN nuclease contains an asparagine residue in place of the wild-type aspartic 
acid residue at position 785 (D785N). 

DNA encoding the TaqDN nuclease was constructed from the gene encoding the Taq 
A/G in two rounds of site-directed mutagenesis. First, the G at position 1397 and the G at 
position 2264 of the Taq A/G gene (SEQ ID NO:39) were changed to A at each position to 
recreate a wild-type TaqPol gene. In a second round of mutagenesis, the wild type TaqPol 
gene was converted to the Taq DN gene by changing the G at position 2356 to A. These 
manipulations were performed as follows. 

DNA encoding the Taq A/G nuclease was recloned from pTTQ18 plasmid into the 
pTrc99A plasmid (Pharmacia) in a two step procedure. First, the pTrc99A vector was 
modified by removing the G at position 270 of the pTrc99A map, creating the pTrc99G 
cloning vector. To this end, pTrc99A plasmid DNA was cut with Ncol and the recessive 3' 
ends were filled-in using the Klenow fragment of E.coli polymerase I in the presence of all 
four dNTPs at 37°C for 15 min. After inactivation of the Klenow fragment by incubation at 
65°C for 10 min, the plasmid DNA was cut with EcoKL and the ends were again filled-in 
using the Klenow fragment in the presence of all four dNTPs at 37° C for 15 min. The 
Klenow fragment was then inactivated by incubation at 65°C for 10 min. The plasmid DNA 
was ethanol precipitated, recircularized by ligation, and used to transform E.coli JM109 cells 
(Promega). Plasmid DNA was isolated from single colonies, and deletion of the G at position 
270 of the pTrc99A map was confirmed by DNA sequencing. 

In a second step, DNA encoding the Taq A/G nuclease was removed from the 
pTTQ18 plasmid using EcoBl and SaK and the DNA fragment carrying the Taq A/G nuclease 
gene was separated on a 1% agarose gel and isolated with Geneclean II Kit (Bio 101, Vista, 
CA). The purified fragment was ligated into the pTrc99G vector which had been cut with 
EcoRl and Sail. The ligation mixture was used to transform competent E.coli JM109 cells 
(Promega). Plasmid DNA was isolated from single colonies and insertion of the Taq A/G 
nuclease gene was confirmed by restriction analysis using EcoBl and Sail. 
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Plasmid DNA pTrcAG carrying the Taq A/G nuclease gene cloned into the pTrc99A 
vector was purified from 200 ml of JM109 overnight culture using QIAGEN Plasmid Maxi 
kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol. pTrcAG plasmid DNA 
was mutagenized using two mutagenic primers, E465 (SEQ ID NO:52) (Integrated DNA 
5 Technologies, Iowa) and R754Q (SEQ ID NO:53) (Integrated DNA Technologies), and the 
selection primer Trans Oligonucleotide AlwNI/Spel (Clontech, Palo Alto, CA ? catalog 
#6488-1) according to TRANSFORMER Site-Directed Mutagenesis Kit protocol (Clontech, 
Palo Alto, CA) to produce a restored wild-type TaqPol gene (pTrcWT). 

pTrcWT plasmid DNA carrying the wild-type TaqPol gene cloned into the pTrc99A 
10 vector was purified from 200 ml of JM109 overnight culture using QIAGEN Plasmid Maxi 

kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol pTrcWT was then 
n mutagenized using the mutagenic primer D785N (SEQ ID NO:54) (Integrated DNA 

Technologies) and the selection primer Switch Oligonucleotide Spel/AlwNI (Clontech, Palo 
\: Alto, CA, catalog #6373-1) according to TRANSFORMER Site-Directed Mutagenesis Kit 
i '15 protocol (Clontech, Palo Alto, CA) to create a plasmid containing DNA encoding the Taq DN 
y nuclease. The DNA sequence encoding the Taq DN nuclease is provided in SEQ ID NO:55; 
5 the amino acid sequence of Taq DN nuclease is provided in SEQ ID NO:56. 

W 2. Modified TthPol Gene: Tth DN 

H20 The Tth DN construct was created by mutating the TthPol described above. The 

sequence encoding an aspartic acid at position 787 was changed by site-specific mutagenesis 
as described above to a sequence encoding asparagine. Mutagenesis of pTrcTth-2 with the 
following oligonucleotide: 5 ' -CAGGAGGAGCTCGTTGTGGACCTGGA-3 5 (SEQ ID NO:57) 
was performed to create the plasmid pTrcTthDN. The mutant protein and protein coding 
25 nucleic acid sequence is termed TthDN SEQ ID NOS:58 and 59 respectively. 

3, Taq DN HT and Tth DN HT 

Six amino acid histidine tags (his-tags) were added onto the carboxy termini of Taq 
DN and Tth DN. The site-directed mutagenesis was performed using the TRANSFORMER 
30 Site Directed Mutagenesis Kit (Clontech) according to the manufacturer's instructions. The 
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mutagenic oligonucleotides used on the plasmids pTaq DN and pTth DN were sequence 
117-067-03, 

5 ' -TCTAGAGGATCT ATCAGTGGTGGTGGTGGTGGTGCTCCTTGGCGGAGAGC-3 ' 
(SEQ ID NO:60) and 

5 ' -TGCCTGC AGGTCGACGCT AGCT AGTGGTGGTGGTGGTGGTGACCCTTGGCGGAAA 
GCC-3' (SEQ ID NO:61), sequence 136-037-05. The selection primer Trans Oligo 
AlwNI/Spel (Clontech, catalog # 6488-1) was used for both mutagenesis reactions. The 
resulting mutant genes were termed Taq DN HT (SEQ ID NO:62, nucleic acid sequence; SEQ 
ID NO:63, amino acid sequence) and Tth DN HT (SEQ ID NO:64, nucleic acid sequence; 
SEQ ID NO:65, amino acid sequence). 

4. Purification of Taq DN HT and Tth DN HT 

Both Taq DN HT and Tth DN HT proteins were expressed in E. coli strain JM109 as 
described in Example 2B2. After ammonium sulfate precipitation and centrifugation, the 
protein pellet was suspended in 0.5 ml of Q buffer (50 mM Tris-HCl, pH 8.0, 0.1 mM 
EDTAm 0.1% Tween 20). The protein was further purified by affinity chromatography using 
His-Bind Resin and Buffer Kit (Novagen) according to the manufacturer's instructions. 1 ml 
of His-Bind resin was transferred into a column, washed with 3 column volumes of sterile 
water, charged with 5 volumes of IX Charge Buffer, and equilibrated with 3 volumes of IX 
Binding Buffer. Four ml of IX Binding Buffer was added to the protein sample and the 
sample solution was loaded onto the column. After washing with 3 ml of IX Binding Buffer 
and 3 ml of IX Wash Buffer, the bound His-Tag protein was eluted with 1 ml of IX Elute 
Buffer. The pure enzyme was then dialyzed in 50% glycerol, 20 mM Tris-HCl, pH 8.0, 50 
mM KC1, 0.5% Tween 20, 0.5% Nonidet P40, and 100 |ag.ml BSA. Enzyme concentrations 
were determined by measuring absorption at 279 mn. 
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EXAMPLE 3 

RNA-dependent 5' nuclease activity of TthPol can be conferred on TaqPol by transfer of 
the N-terminal portion of the DNA polymerase domain 

5 A. Preparation and purification of substrate structures having either a DNA or an 
RNA target strand 

The downstream (SEQ ID NO:16) and upstream probes (SEQ ID NO: 15) and the IL-6 
DNA (SEQ ID NO: 18) (figure 10) target strand were synthesized on a PerSeptive Biosystems 
instrument using standard phosphoramidite chemistry (Glen Research). The synthetic 
10 RNA-DNA chimeric IrT target labeled with biotin at the 5 '-end (figure 20 A) was synthesized 
utilizing 2'-ACE RNA chemistry (Dharmacon Research). The 2'-protecting groups were 
removed by acid-catalyzed hydrolysis according to the manufacturer's instructions. The 
J downstream probes labeled with 5 '-fluorescein (Fl) or 5'-tetrachloro-fluorescein (TET) at their 
if! 5' ends were purified by reverse phase HPLC using a Resource Q column 
''15 (Amersham-Pharmacia Biotech). The 648-nucleotide IL-6 RNA target (SEQ ID NO: 17) 
K (figure 10) was synthesized by T7 RNA polymerase runoff-transcription of the cloned 
* fragment of human IL-6 cDNA (nucleotides 64-691 of the sequence published in May et al, 
Q Proc. Natl. Acad. Sci., 83:8957 [1986]) using a Megascript Kit (Ambion). All 
ni oligonucleotides were finally purified by separation on a 20% denaturing polyacrylamide gel 

:§0 followed by excision and elution of the major band. Oligonucleotide concentration was 
O determined by measuring absorption at 260 nm. The biotin labeled IrT target was incubated 
with a 5-fold excess of streptavidin (Promega) in a buffer containing 10 mM MOPS, pH 7.5, 
0.05% Tween 20, 0.05% NP-40 and 10 ng/ml tRNA at room temperature for 10 min. 

25 B. Introduction of restriction sites to make chimeras 

The restriction sites used for formation of chimerical proteins, described below, were 
chosen for convenience. The restriction sites in the following example have been strategically 
placed to surround regions shown by crystal structure and other analysis to be functional 
domains (See, Figures 6, 7, and 19). Different sites, either naturally occurring or created via 
30 directed mutagenesis can be used to make similar constructs with other Type A polymerase 
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genes from related organisms. It is desirable that the mutations all be silent with respect to 
protein function. By studying the nucleic acid sequence and the amino acid sequence of the 
protein, one can introduce changes in the nucleic acid sequence that have no effect on the 
corresponding amino acid sequence. If the nucleic acid change required affects an amino 
acid, one can make the alteration such that the new amino acid has the same or similar 
characteristics of the one replaced. If neither of these options is possible, one can test the 
mutant enzyme for function to determine if the nucleic acid alteration caused a change in 
protein activity, specificity or function. It is not intended that the invention be limited by the 
particular restriction sites selected or introduced for the creation of the improved enzymes of 
the present invention. 

C. Generation of Tth DN RX HT and Taq DN RX HT 

Mutagenesis was performed to introduce 3 additional, unique restriction sites into the 
polymerase domain of both the Taq DN HT and Tth DN HT enzymes. Site specific 
mutagenesis was performed using the Transformer Site-Directed Mutagenesis Kit from 
(Clonetech) according to manufacturer's instructions. One of two different selection primers, 
Trans Oligo AlwNI/Spel or Switch Oligo Spel/AlwNI (Clontech, Palo Alto CA catalog 
#6488-1 or catalog #6373-1) was used for all mutagenesis reactions described. The selection 
oligo used in a given reaction is dependent on the selection restriction site present in the 
vector. All mutagenic primers were synthesized by standard synthetic chemistry. Resultant 
colonies were expressed in E.coli strain JM109. 

The Not I sites (amino acid position 328) were created using the mutagenic primers 
5'-gccgccaggggcggccgcgtccaccgggcc (SEQ ID NO:66) and 

5'-gcctgcaggggcggccgcgtgcaccggggca (SEQ ID NO:67) corresponding to the sense strands of 
the Taq DN HT and the Tth DN HT genes, respectively. The BstI (amino acid position 382) 
and Ndel (amino acid position 443) sites were introduced into both genes using sense strand 
mutagenic primes 5'-ctcctggacccttcgaacaccacccc (SEQ ID NO:68) and 
5'-gtcctggcccatatggaggccac (SEQ ID NO:69). The mutant plasmids were over-expressed and 
purified using Qiagen QiaPrep Spin Mini Prep Kit (cat. # 27106). The vectors were tested for 
the presence of the restriction sites by DNA sequencing and restriction mapping. These 
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constructs are termed Tth DN RX HT (DNA sequence SEQ ID NO: 70; amino acid sequence 
SEQ ID NO:72) and Taq DN RX HT (DNA sequence SEQ ID NO:71; amino acid sequence 
SEQ ID NO:73). 



5 D. Chimeras 

The chimeric constructs shown in Figure 19 were created by exchanging homologous 
DNA fragments defined by the restriction endonuclease sites EcoRI (E) and BamHI (B), 
common for both genes, the cloning vector site Sail (S) and the new sites, NotI (N), BstBI 
(Bs), Ndel (D) created at the homologous positions of both genes by site directed 
1 0 mutagenesis. In generating these chimeric enzymes, two different pieces of DNA are ligated 
together to yield the final construct. The larger piece of DNA that contains the plasmid 
vector as well as part of the Taq or Tth (or parts of both) sequence will be termed the 
% "vector." The smaller piece of DNA that contains sequences of either the Taq or Tth (or 
W parts of both) polymerase will be termed the "insert." 

All restriction enzymes were from New England Biolabs or Promega and used in 
S reactions with the accompanying buffer, according to the manufacturer's instructions. 
=tf Reactions were done in 20 (il volume with about 500 ng of DNA per reaction, at the optimal 
p temperature for the specified enzyme. More than one enzyme was used in a single reaction 
2! (double digest) if the enzymes were compatible with respect to reaction buffer conditions and 
=|l0 reaction temperature. If the enzymes in question were not compatible with respect to buffer 
S conditions, the enzyme requiring the lowest salt condition was used first. After the 

completion of that reaction, buffer conditions were changed to be optimal or better suited to 
the second enzyme, and the second reaction was performed. These are common restriction 
enzyme digest strategies, well known to those in the art of basic molecular biology (Maniatis, 
25 supra). 

The digested restriction fragments were gel isolated for optimal ligation efficiency. 
Two jliI of 10X loading dye (50% glycerol, IX TAE, 0.5% bromophenol blue) were added to 
the 20 \i\ reaction. The entire volume was loaded and run on a 1%, IX TAE agarose gel 
containing 1 jal of a 1% ethidium bromide solution per 100 ml of agarose gel solution. The 
30 digested fragments were visualized under UV light, and the appropriate fragments (as 
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determined by size) were excised from the gel These fragments were then purified using the 
Qiagen Gel Extractio Kit, (cat # 28706) according to the manufacturer's instructions. 

Ligations were performed in a 10 |il volume, using 400 units per reaction of T4 DNA 
Ligase enzyme from New England Biolabs (catalog #202L), with the accompanying reaction 
buffer. Ligation reactions were done at room temperature for 1 hour, with 1 |j,l of each of the 
Qiagen-purified fragments (approximately 20-50 ng of each DNA, depending on recovery 
from the gel isolation). Ligation products were then transformed into E. coli strain JM 109 
and plated onto an appropriate growth and selection medium, such as LB with 100jj,g/ml of 
ampicillin to select for transformants. 

For each ligation reaction, six transformants were tested to determine if the desired 
construct was present. Plasmid DNA was purified and isolated using the QiaPrep Spin Mini 
Prep Kit, according to manufacturer's instructions. The constructs were verified by DNA 
sequencing and by restriction mapping. 

Expression and purification of the chimeric enzymes was done as follows. Plasmids 
were transformed into E. coli strain JM109 (Promega). Log phase cultures (200 ml) of 
JM109 were induced with 0.5 mM IPTG (Promega) and grown for an additional 16 hours 
prior to harvest. Crude extracts containing soluble proteins were prepared by lysis of pelleted 
cells in 5 ml of 10 mM Tris-HCl, pH 8.3, ImM EDTA, 0.5mg/ml lysozyme during incubation 
at room temperature for 15 minutes. The lysate was mixed with 5 ml of 10 mM Tris-HCl pH 
7.8, 50 mM KC1, 1 mM EDTA, 0.5% Tween 20, 0.5% Nonidet P-40, heated at 72°C for 30 
minutes, and cell debris was removed by centrifugation at 12,000x g for 5 minutes. Final 
purification of the protein was done by affinity chromatograpy using an Econo-Pac heparin 
cartridge (Bio-Rad) and Dionex DX 500 HPLC instrument. Briefly, the cartridge was 
equilibrated with 50mM Tris-HCl pH 8, 1 mM EDTA, and an enzyme extract dialyzed 
against the same buffer was loaded on the column and eluted with a linear gradient of NaCl 
(0-2 M) in the same buffer. The HPLC-purified protein was dialyzed and stored in 50% 
(vol/vol) glycerol, 20 mM Tris-HCl pH 8.0, 50 mM KC1, 0.5% Tween 20, 0.5% Nonidet 
P-40, and 100jj,g/m BSA. The enzymes were purified to homogeneity according to 
SDS-PAGE, and the enzyme concentrations were determined by measuring absorption at 279 
nm. 
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1. Construction of TaqTth(N) and TthTaq(N) 

The first exchange that was performed involved the polymerase domains of the two 
enzymes. Separation of the nuclease domain (the N-terminal end of the protein) from the 
polymerase domain (the C-terminal portion of the protein) was accomplished by cutting both 
genes with the restriction endonucleases EcoRI and Not! The approximately 900 base pair 
fragment from the Tth DN RX HT gene was cloned into the homologous sites of the Taq DN 
RX HT gene, and the approximately 900 base pair fragment from the Taq DN RX HT gene 
was cloned into the homologous sites of the Tth DN RX HT gene, yielding two chimeras, 
TaqTth(N) (DNA sequence SEQ ID NO:74; amino acid sequence SEQ ID NO:75) which has 
the Taq DN RX HT 5' nuclease domain and the Tth DN RX HT polymerase domain, and 
TthTaq(N) (DNA sequence SEQ ID NO:76; amino acid sequence SEQ ID NO:77) which is 
made up of the Tth DN RX HT 5' nuclease domain and the Taq DN RX HT polymerase 
domain. 

2. Construction of TaqTth(N-B) 

The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and the 
larger, vector fragment was gel isolated as detailed above. The Tth DN RX HT construct was 
also cut with Ndel and BamHI and the smaller (approximately 795 base pairs) Tth fragment 
was gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the Taq 
Ndel-BamHI vector as detailed above to generate the TaqTth(N-B) (DNA sequence SEQ ID 
NO:78; amino acid sequence SEQ ID NO:79). 

3. Construction of TaqTth(B-S) 

The Taq DN RX HT construct was cut with the enzymes BamHI and Sail and the 
larger vector fragment was gel isolated as detailed above. The Tth DN RX HT construct was 
also cut with BamHI and Sail and the smaller (approximately 741 base pairs) Tth fragment 
was gel isolated and purified. The Tth BamHI-Sall insert was ligated into the Taq 
BamHI-Sall vector as detailed above to generate the TaqTth(B-S) (DNA sequence SEQ ID 
NO:80; amino acid sequence SEQ ID NO:81). 
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4. Construction of TaqTth(N-D) 

The Taq DN RX HT construct was cut with the enzymes NotI and Ndel and the larger 
vector fragment was isolated as detailed above. The Tth DN RX HT construct was also cut 
with NotI and Ndel and the smaller (approximately 345 base pairs) Tth fragment was gel 
isolated and purified. The Tth Notl-Ndel insert was ligated into the Taq Notl-Ndel vector as 
detailed above to generate the TaqTth(N-D) (DNA sequence SEQ ID NO:82; amino acid 
sequence SEQ ID NO:83). 

5. Construction of TaqTth(D-B) 

The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT construct was also 
cut with Ndel and BamHI and the smaller (approximately 450 base pairs) Tth fragment was 
gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the Taq Ndel-BamHI 
vector as detailed above to generate the TaqTth(D-B) (DNA sequence SEQ ID NO:84; amino 
acid sequence SEQ ID NO:85). 

6. Construction of TaqTth(Bs-B) 

The Taq DN RX HT construct was cut with the enzymes BstBI and BamHI and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT construct was also 
cut with BstBI and BamHI and the smaller (approximately 633 base pairs) Tth fragment was 
gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the Taq Ndel-BamHI 
vector as detailed above to generate TaqTth(Bs-B) (DNA sequence SEQ ID NO:86; amino 
acid sequence SEQ ID NO:87). 

7. Construction of TaqTth(N-Bs) 

The Taq DN RX HT construct was cut with the enzymes NotI and BstBI and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT construct was also 
cut with NotI and BstBI and the smaller (approximately 162 base pairs) Tth fragment was gel 
isolated and purified. The Tth Notl-BstBI insert was ligated into the Taq Notl-BstBI vector 
as detailed above to generate TaqTth(N-Bs) (DNA sequence SEQ ID NO: 88; amino acid 
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sequence SEQ ID NO:89). 



8. Construction of TthTaq(B-S) 

The Tth DN RX HT construct was cut with the enzymes BamHI and Sail and the 
larger vector fragment was isolated as detailed above. The Taq DN RX HT construct was 
also cut with BamHI and Sail and the smaller (approximately 741 base pairs) Tth fragment 
was gel isolated and purified. The Taq BamHI-Sall insert was ligated into the Tth 
BamHI-Sall vector as detailed above to generate the TthTaq(B-S) (DNA sequence SEQ ID 
NO:90; amino acid sequence SEQ ID NO:91). 

9. Construction of Tth Taq(N-B) 

The Tth DN RX HT construct was cut with the enzymes NotI and BamHI and the 
larger vector fragment was isolated as detailed above. The Taq DN RX HT construct was 
also cut with NotI and BamHI and the smaller (approximately 795 base pairs) Tth fragment 
was gel isolated and purified. The Taq Notl-BamHI insert was ligated into the Tth 
Notl-BamHI vector as detailed above to generate the TthTaq(N-B) (DNA sequence SEQ ID 
NO:92; amino acid sequence SEQ ID NO:93). 

The cleavage activities of these chimerical proteins were characterized as describe in 
Example 1, part A, and a comparison of the cleavage cycling rates on an RNA target is 
shown in Figure 12. As further discussed in the Description of the Invention, these data show 
that elements found in the central third of the TthPol protein are important in conferring the 
TthPol-like RNA-dependent cleavage activity on the chimerical proteins comprising portions 
of TaqPol. 

EXAMPLE 4 

Alterations influencing RNA-dependent 5' nuclease activity do not necessarily influence 

RNA-dependent DNA polymerase activity 

TthPol is known to have a more active RNA template dependent DNA polymerase 
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than does the TaqPol (Myers and Gelfand, Biochemistry 30:7661 [1991]). To determine 
whether the RNA template dependent 5' nuclease activity of the Thermus DNA Pol I enzymes 
is related to their RNA-dependent polymerase activity, the D785N and D787N mutations used 
to create the polymerase-deficient versions of TaqPol and TthPol, respectively were reversed. 
Polymerase activity was similarly restored to the TaqTth (N) (DNA sequence SEQ ID NO:94; 
amino acid sequence SEQ ID NO:95), TaqTth(N-B) (DNA sequence SEQ ID NO:96; amino 
acid sequence SEQ ID NO:97), TaqTth(B-S) (DNA sequence SEQ ID NO:98; amino acid 
sequence SEQ ID NO:99) chimeras, and the TaqPol(W417L/G418K/ E507Q) (DNA sequence 
SEQ ID NO:100; amino acid sequence SEQ ID NO:101) mutant proteins. 

Polymerase function was restored in all the above mentioned enzyme mutants by 
inserting the BamHI to Sail fragment of the native, non-DN sequence into the selected 
chimera or mutant enzyme. For example, the mutant construct TaqTth(N-B) was cut with the 
restriction enzyme BamHI (approximate amino acid position 593) and the restriction enzyme 
Sail (approximate amino acid position 840). The larger vector fragment was gel purified as 
described in Example 3D. The native TaqPol construct was also cut with the restriction 
endonucleases BamHI and Sail, and the smaller insert fragment containing the native amino 
acid sequence was also gel purified. The insert fragment was then ligated into the vector as 
detailed in Experimental Example 3D. 

The polymerase activities of these proteins were evaluated by extension of the 
dT 25 _3 5 -oligonucleotide primer with fluorescein-labeled dUTP in the presence of either 
poly(dA) or poly(A) template. Primer extension reactions were carried out in 10 ul buffer 
containing 10 mM MOPS, pH7.5, 5 mM MgS0 4 , 100 mM KC1. Forty ng of enzyme were 
used to extend 10 uM (dT) 25 . 30 primer in the presence of either 10 uM poly(A) 286 or 1 uM 
poly(dA) 273 template, 45 uM dTTP and 5 uM Fl-dUTP at 60°C for 30 min. Reactions were 
stopped with 10 ul of stop solution (95% formamide, 10 mM EDTA, 0.02% methyl violet 
dye). Samples (3 ul) were fractionated on a 15% denaturing acrylamide gel and the fraction 
of incorporated Fl-dUTP was quantitated using an FMBIO-100 fluorescent gel scanner 
(Hitachi) equipped with a 505 nm filter as described above. 

As shown in Figure 16, the DNA-dependent polymerase activities are very similar for 
all constructs used in this experiment, whereas the RNA-dependent polymerase activities of 
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TthPol, TaqTth(N) and TaqTth(B-S) are at least 6-fold higher than the activities of TaqPol, 
TaqTth(N-B) and the TaqPol W417L/G418K/E507Q mutant. From the analysis of these 
results, it can be concluded that the high RNA-dependent DNA polymerase activity of TthPol 
is determined by the C-terminal half of the polymerase domain (roughly, amino acids 
5 593-830) and that the RNA-dependent 5' nuclease and polymerase activities are not related to 
each other, and are controlled by different regions. 

EXAMPLE 5 

Specific point mutants in Taq DN RX HT developed from information from the 
10 chimeric studies 



The chimeric studies (Example 3, above) suggest that the part of the TthPol sequence 
3 determining its high RNA-dependent 5' nuclease activity comprises the BstBI-BamHI region 

located approximately between amino acid 382 and 593. Comparison of the amino acid 
H5 sequences between the BstBI and BamHI regions of Tth DN RX HT and Taq DN RX HT 
5 (SEQ ID NOS:20 and 19, respectively) revealed only 25 differences (Figure 13). Among 

s ^ these, 12 amino acid changes were conservative while 13 of the differences resulted in a 

O changes in charge. Since the analysis of the chimeric enzymes suggested that the critical 

? y mutations are located in both the BstBI-Ndel and the Ndel-BamHI regions of Tth DN RX 

j^O HT, site specific mutagenesis was used to introduce the Tth DN RX HT specific amino acids 
O into the BstBI-Ndel and Ndel-BamHI regions of the TaqTth(D-B) and the TaqTth(N-D) 

respectively. 

Six Tth DN RX HT specific substitutions were generated in the BstBI-Ndel region of 
the TaqTth(D-B) by single or double amino acid mutagenesis. Similarly, 12 Tth DN RX HT 
25 specific amino acid changes were introduced at the homologous position of the Ndel-BamHI 
region of the TaqTth(N-D). 

Plasmid DNA was purified from 200 ml of JM109 overnight culture using QIAGEN 
Plasmid Maxi Kit (QIAGEN, Chatsworth, CA) according to the manufacturer's protocol to 
obtain enough starting material for all mutagenesis reactions. All site specific mutations were 
30 introduced using the Transformer Site Directed mutagenesis Kit (Clontech) according to the 
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manufacturer's protocol; specific sequence information for the mutagenic primers used for 
each site is provided below. One of two different selection primers, Trans Oligo AlwNI/Spel 
or Switch Oligo Spel/AlwNI (Clontech, Palo Alto, CA catalog #6488-1 or catalog #6373-1) 
was used for all mutagenesis reactions described. The selection oligo used in a given reaction 
is dependent on the restriction site present in the vector. All mutagenic primers were 
synthesized by standard synthetic chemistry. Resultant colonies were E.coli strain JM109. 

1. Construction of TaqTth(D-B) E404H (DNA sequence SEQ ID NO:102; 
amino acid sequence SEQ ID NO:103) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using the 
mutagenic primer 240-60-01 5'-gag gag gcg ggg cac egg gec gec ctt-3' (SEQ ED NO:104) to 
introduce the E404H mutation. 

2. Construction of TaqTth(D-B) F413H/A414R (DNA sequence SEQ ID 
NO:105; amino acid sequence SEQ ID NO:106) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using the 
mutagenic primer 240-60-02 5'-ctt tec gag agg etc cat egg aac ctg tgg ggg agg-3' (SEQ ID 
NO:107) to introduce the F413H and the A414R mutations. 

3. Construction of TaqTth(D-B) W417L/G418K (DNA sequence SEQ ID 
NO:108; amino acid sequence SEQ ID NO:109) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using the 
mutagenic primer 240-60-03 5 '-etc ttc gec aac ctg ctt aag agg ctt gag ggg gag-3' (SEQ ID 
NO: 110) to introduce the W417L and the G418K mutations. 

4. Construction of TaqTth(D-B) A439R (DNA sequence SEQ ID NOrlll; 
amino acid sequence SEQ ID NO:112) 

Site specific mutagenesis was performed on pTrc99A TaqTth(ND-B) DNA using the 
mutagenic primer 240-60-04 5'-agg ccc ctt tec egg gtc ctg gec cat-3' (SEQ ID NO:113) to 
introduce the A439R mutation. 
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5. Construction of TaqTth(N-D) L451R (DNA sequence SEQ ID NO:114; 
amino acid sequence SEQ ID NO:115) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-05 5'-acg ggg gtg cgc egg gac gtg gec tat-3' (SEQ ID NO:116) to 
introduce the L415 mutation. 

6. Construction of TaqTth(N-D) R457Q (DNA sequence SEQ ID NO:117; 
amino acid sequence SEQ ID NO:118) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-06 5 '-gtg gec tat etc cag gec ttg tec ctg-3' (SEQ ID NO: 119) to 
introduce the L415Q mutation. 

7. Construction of TaqTth(N-D) V463L (DNA sequence SEQ ID NO:120; 
amino acid sequence SEQ ID NO: 121) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-07 5 '-ttg tec ctg gag ctt gec gag gag atc-3' (SEQ ID NO: 122) to 
introduce the V463L mutation. 

8. Construction of TaqTth(N-D) A468R (DNA sequence SEQ ID NO:123; 
amino acid sequence SEQ ID NO: 124) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-08 5'-gcc gag gag ate cgc cgc etc gag gcc-3' (SEQ ID NO:125) to 
introduce the A468R mutation. 

9. Construction of TaqTth(N-D) A472E (DNA sequence SEQ ID NO:126; 
amino acid sequence SEQ ID NO: 127) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-09 5 '-gec cgc etc gag gag gag gtc ttc cgc-3' (SEQ ID NO: 128) to 
introduce the A472E mutation. 
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10. Construction of TaqTth(N-D) G499R (DNA sequence SEQ ID NO:129; 
amino acid sequence SEQ ID NO:130) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
5 mutagenic primer 240-60-10 5 ? -ttt gac gag eta agg ctt ccc gec atc-3' (SEQ ID NO:131) to 
introduce the G499R mutation. 

11. Construction of TaqTth(N-D) E507Q (DNA sequence SEQ ID NO:132; 
amino acid sequence SEQ ID NO: 133) 

10 Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 

mutagenic primer 276-046-04 5'-atc gec aag acg caa aag acc ggc aag-3' (SEQ ID NO:134) to 
introduce the E507Q mutation. 

g 12. Construction of TaqTth(N-D) Y535H (DNA sequence SEQ ID NO:135; 

45 amino acid sequence SEQ ID NO:136) 

S Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 

^ mutagenic primer 240-60-11 5 '-aag ate ctg cag cac egg gag etc acc-3' (SEQ ID NO: 137) to 

O introduce the Y535H mutation. 

jlo 13. Construction of TaqTth(N-D) S543N (DNA sequence SEQ ID NO:138; 

Q amino acid sequence SEQ ID NO:139) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-12 5'-acc aag ctg aag aac acc tac att gac-3' (SEQ ID NO:140) to 
introduce the S543N mutation. 

25 

14, Construction of TaqTth(N-D) I546V (DNA sequence SEQ ID NO:141; 
amino acid sequence SEQ ID NO: 142) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-13 5 J -aag age acc tac gtg gac ccc ttg ccg-3' (SEQ ID NO:143) to 
30 introduce the I546V mutation. 
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15. Construction of TaqTth(N-D) D551S/I553V (DNA sequence SEQ ID 
NO:144; amino acid sequence SEQ ID NO:145) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-14 5'-att gac ccc ttg ccg age etc gtc cac ccc agg acg ggc-3' (SEQ 
ID NO: 146) to introduce the D551S and the I553V mutations. 

16. Construction of TaqDN RX HT W417L/G418K/E507Q (DNA sequence 
SEQ ID NO:147; amino acid sequence SEQ ID NO:148) 

The TaqDN RX HT W417L/G418K/E507Q triple mutant was made by combining the 
TaqTth(D-B)W417L/G418K with the TaqTth(N-D) E507Q. TaqTth(D-B)W417L/G418K was 
cut with the restriction enzymes Ndel and BamHI, and the larger, vector fragment was 
isolated as detailed in Example 3. The TaqTth(N-D) E507Q construct was also cut with Ndel 
and BamHI and the smaller (approximately 795 base pairs) fragment was gel isolated and 
purified as detailed in Example 3. The Ndel-BamHI insert was ligated into the gel purified 
vector, as detailed in Example 3. 

17. Construction of TaqDN RX HT W417L/E507Q (DNA sequence SEQ ID 
NO:149; amino acid sequence SEQ ID NO:150) 

Starting with TaqDN RX HT W417L/G418K/E507Q described above, mutagenic 
primer 337-01-02: 5'-TTC GCC AAC CTG CTT GGG AGG CTT GAG GGG GAG -3' 
(SEQ ID NO: 151) was used in a site specific mutagenesis reaction to change the K at amino 
acid position 418 back to the wild-type amino acid, G. Site specific mutagenesis was done 
using the Transformer Site Directed Mutagenesis Kit (Clonetech) according to the 
manufacturer's instructions, and as described in Experimental Example 4. 

18. Construction of TaqDN RX HT G418K/E507Q (DNA sequence SEQ ID 
NO:152; amino acid sequence SEQ ID NO:153) 

Starting with TaqDN RX HT W417L/G418K/E507Q described above, mutagenic 
primer 337-01-01: 5'-CTC TTC GCC AAC CTG TGG AAG AGG CTT GAG GGG -3' was 
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used in a site specific mutagenesis reaction to change the L at amino acid position 417 back 
to the wild-type amino acid, W. Site specific mutagenesis was done using the Transformer 
Site Directed Mutagenesis Kit (Clonetech) according to the manufacturer's instructions;, and as 
described in Experimental Example 4. 

Expression and purification of mutant proteins was done as detailed in Example 3, and 
the cleavage activities of these proteins were characterized as describe in Example 1 ? part A. 
A comparison of the cleavage cycling rates of a selection of these mutant proteins on an RNA 
target is shown in Figure 14. As further discussed in the Description of the Invention, these 
data show that amino acids in the regions 417/418 and amino acid 507 are important in the 
conferring the TthPol-like RNA-dependent cleavage activity on the chimerical proteins 
comprising portions of TaqPol in combination with portions of TthPol that are not 
independently capable of providing enhanced RNA dependent activity (i.e., the D-B and N-D 
portions of Tth). As described in the Description of the Invention, Taq DN RX HT variant 
carrying only the W417L, G418K and E507Q substitutions were created. By comparing their 
cleavage rates to that of Tth DN RX HT on the IL-6 RNA substrate as described in Example 
1, these mutations were determined to be sufficient to increase the Taq DN RX HT activity to 
the Tth DN RX HT level. Figure 15 shows that the Taq DN RX HT W417L/G418K/E507Q 
and Taq DN RX HT G418K/E507Q mutants have 1.4 times higher activity than Tth DN RX 
HT and more than 4 fold higher activity than Taq DN RX HT, whereas the Taq DN RX HT 
W417L/E507Q mutant has the same activity as the enzyme, which is about 3 fold higher than 
Taq DN RX HT. These results demonstrate that K4 18 and Q5 07 of TthPol are particularly 
important amino acids in providing RNA dependent 5' nuclease activity that is enhanced 
compared to TaqPol. 

EXAMPLE 6 

RNA-dependent 5' nuclease properties of the Taq DN RX HT G418K/E507Q 5' nuclease 
are similar to Tth DN RX HT with respect to salt and temperature optima 

To determine if the G418K/E507Q mutations caused any significant changes in the 
properties of the Taq DN RX HT mutant in addition to the increased cleavage rate with the 
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RNA target, the Taq DN RX HT G418K/E507Q (SEQ ID NO: 153), Taq DN RX HT (SEQ 
ID NO:73), and Tth DN RX HT (SEQ ID NO:72) enzymes were compared in the RNA 
template dependent 5' nuclease assay under conditions where temperature and the 
concentrations of salt and divalent ions were varied. The upstream DNA and the template 
5 RNA strands of the substrate used in this study were linked into a single IrT molecule (SEQ 
ID NO:27) as shown in Figure 20 A, and the labeled downstream probe (SEQ ID NO:26) was 
present in large excess. The 5' end of the target RNA strand was blocked with a 
biotin-streptavidin complex to prevent any non-specific degradation by the enzyme during the 
reaction (Lyamichev et aL, Science 260:778 [1993], Johnson et aL, Science 269:238 [1995]). 
10 The cleavage rates for Taq DN RX HT G418K/E507Q, Taq DN RX HT, and Tth DN RX HT 
are plotted as functions of temperature in Figure 20B. The closed circles represent enzyme 
Taq DN RX HT, the open circles represent enzyme Tth DN RX HT, and the Xs represent 
Jj enzyme Taq DN RX HT G418K/E507Q. The difference in the activities of Tth and Taq DN 

J* j RX HT enzymes with the IrT substrate is even greater than the difference found with the IL-6 

Hi 5 RNA substrate when tested in a cleavage assay as described in Example 1. The 
m G418K/E507Q mutations increase the activity of the Taq enzyme more than tenfold and by 

+ 25% compared with the Tth enzyme. All three enzymes show a typical temperature profile of 

O the invasive signal amplification reaction and have the same optimal temperature. No 

Sj significant effect of G418K/E507Q mutations on DNA dependent 5 ? nuclease activity of Taq 

#0 DN RX HT with the all-DNA substrate analogous to IrT (SEQ ID NO:28) under the same 
Q conditions was found. 

The effects of KC1 and MgS0 4 concentrations on the 5' nuclease activity of Taq DN 
RX HT G418K/E507Q, Taq DN RX HT, and Tth DN RX HT with the IrT substrate are 
shown in Figure 20C and D. The activities of all enzymes have similar salt dependencies 
25 with an optimal KC1 concentration of 100 mM for Taq DN RX HT G418K/E507Q and Tth 
DN RX HT and 50 mM for Taq DN RX HT. The optimal MgS0 4 concentration for all 
enzymes is approximately 8 mM. The analysis of the data presented in Figure 20 suggests 
that the properties of Taq DN RX HT G418K/E507Q are much closer to those of Tth DN RX 
HT rather than Taq DN RX HT confirming the key role of the G418K/E507Q mutations in 
30 the recognition of the substrate with an RNA target. 
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To understand the mechanism of the reduction of the 5' nuclease activity in the 
presence of an RNA versus a DNA target, the Michaelis constant (KJ and the maximal 
catalytic rate (k cat ) of all three enzymes were determined, using an excess of the IrT substrate 
(SEQ ID NO:27) and the downstream probe (SEQ ID NO:26) and a limiting enzyme 
5 concentration. For these measurements, ten-ul reactions were assembled containing 10 mM 
MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ng/ml tRNA, 4 mM MgCl 2 , 1 nM 
of enzyme (Taq DN RX HT, Tth DN RX HT, or Taq DN RX HT G418K/E507Q) and 
different concentrations (0.125, 0.25, 0.5 or 1 uM) of an equimolar mixture of the IrT target 
and the downstream probe. The cleavage kinetics for each enzyme and each substrate 
10 concentration were measured at 46°C. Reactions were stopped by the addition of 10 ul of 

95% formamide containing 10 mM EDTA and 0.02% methyl violet (Sigma). One ul of each 
stopped reaction digest was fractionated on a 20% denaturing acrylamide gel (19:1 
25 cross-linked), with 7M urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1.4mM EDTA. 
Jf; Gels were scanned on an FMBIO-100 fluorescent gel scanner (Hitachi) using a 585 nm filter. 
M5 The fraction of cleaved product (determined from intensities of bands corresponding to uncut 
p and cut substrate with FMBIO Analysis software, version 6.0, Hitachi) was plotted as a 

^ function of reaction time. The initial cleavage rates were determined from the slopes of linear 

O part of the cleavage kinetics and were defined as the concentration of cut product divided by 

fy the enzyme concentration and the time of the reaction (in minutes). The Michaelis constant 

2120 K m and the maximal catalytic rate k cat of each enzyme with IrT substrate were determined 
O from the plots of the initial cleavage rate as functions of the substrate concentration. 

It was found that all three enzymes have similar values (in the range of 200-300 
nM) and values of approximately 4 min 1 for Taq DN RX HT and Tth DN RX HT and of 
9 mm 1 for Taq DN RX HT G418K/E507Q. That the G418K/E507Q mutations increase the 
25 k cat of Taq DN RX HT more than two fold, but have little effect on suggest that the 

mutations position the substrate in an orientation more appropriate for cleavage, rather than 
simply increase the binding constant. 



30 
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EXAMPLE 7 

Use of molecular modeling to further improve RNA-dependent 5' nuclease activity 



A. Point mutants 

To develop enzymes with altered function, sequence changes were introduced by site 
specific mutagenesis in predetermined locations or by random mutagenesis. Locations for site 
specific mutagenesis were chosen based on evidence from chimeric studies, relevant published 
literature, and molecular modeling. Seven additional mutant enzymes were developed from 
the Tth DN RX HT enzyme, and twenty additional mutant enzymes were developed from the 
Taq DN RX HT enzyme, both discussed previously. Some of the mutant enzymes are the 
result of multiple mutagenesis reactions, that is, more than one change has been introduced to 
obtain the final product. Mutation reactions were done using the Tth DN RX HT construct 
(SEQ ID NO:70) described in Example 2C2, or the Taq DN RX HT construct (SEQ ID 
NO:71), described in Example 2C1 unless otherwise stated. Plasmid DNA was purified from 
200 ml of JM109 overnight culture using QIAGEN Plasmid Maxi Kit (QIAGEN, Chatsworth, 
CA) according to the manufacturer's protocol to obtain enough starting material for all 
mutagenesis reactions. All site specific mutations were introduced using the Transformer Site 
Directed mutagenesis Kit (Clontech) according to the manufacturer's protocol. One of two 
different selection primers, Trans Oligo AlwNI/Spel or Switch Oligo Spel/AlwNI (Clontech, 
Palo Alto CA catalog #6488-1 or catalog #6373-1) was used for all mutagenesis reactions 
described. The selection oligo used in a given reaction is dependent on the restriction site 
present in the vector. All mutagenic primers for both the site specific mutagenesis and the 
random mutagenesis were synthesized by standard synthetic chemistry. Resultant colonies for 
both types of reactions were E.coli strain JM109. Random mutagenesis methods are described 
below. 

Mutants were tested via the rapid screening protocol detailed in Example 1. Then, if 
more detailed analysis was desired, or if a larger protein preparation was required, expression 
and purification of mutant proteins was done as detailed in Example 3. 
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1. Construction of Tth DN RX HT H641A, Tth DN RX HT H748A, Tth DN 
RX HT H786A 

Site specific mutagenesis was performed on pTrc99A Tth DN RX HT DNA using the 
mutagenic primer 583-001-02: 5 '-get tgc ggt ctg ggt ggc gat gtc ctt ccc ctc-3' (SEQ ID 
NO:158) to introduce the H641A mutation (DNA sequence SEQ ID NO:156; amino acid 
sequence SEQ ID NO:157), or the mutagenic primer 583-001-03: 5' cat gtt gaa ggc cat ggc 
etc cgc ggc etc cct-3' (SEQ ID NO: 161) to generate the H748A mutant (DNA sequence SEQ 
ID NO:159; amino acid sequence SEQ ID NO:160), or the mutagenic primer 583-001-04: 
5'-cag gag gag etc gtt ggc gac ctg gag gag-3' (SEQ ID NO:164) to generate the H786A 
mutant enzyme (DNA sequence SEQ ID NO: 162; amino acid sequence SEQ ID NO: 163). 

2. Construction of Tth DN RX HT (H786A/G506K/Q509K) 

Starting with the mutant Tth DN RX HT H786A, generated above, site specific 
mutagenesis was done using the mutagenic primer 604-022-02: 5'-gga gcg ctt gec tgt ctt ctt 
cgt ctt ctt caa ggc ggg agg cct-3' (SEQ ID NO: 167) to generate this variant termed 
"TthAKK", (DNA sequence SEQ ID NO:165; amino acid sequence SEQ ID NO:166). 

3. Construction of Taq DN RX HT (W417L/G418K/E507Q/H784A) 
Mutagenic oligonucleotide 158-029-02: 5'-gag gac cag etc gtt ggc gac ctg aag gag 

cat-3' (SEQ ID NO: 170) was used in a site specific mutagenesis reaction to introduce the 
H784A mutation and generate this construct termed "Taq4M" (DNA sequence SEQ ID 
N0.168; amino acid sequence SEQ ID NO:169). 

4. Construction of Taq4M H639A, Taq4M R587A, Taq4M G504K and 
Taq4M G80E 

Site specific mutagenesis was done on the Taq4M mutant, using primer 473-010-11: 
5'-gaggggcgggacatcgccacggagaccgccagc-3' (SEQ ID NO: 173) to generate the Taq 4M H639A 
mutant (DNA sequence SEQ ID NO: 171; amino acid sequence SEQ ID NO: 172), primer 
473-010-10: 5'-cag aac ate ccc gtc gec acc ccg ctt ggg cag-3' (SEQ ID NO:176) to generate 
Taq 4M R587A (DNA sequence SEQ ID NO: 174; amino acid sequence SEQ ID NO: 175), 
primer 300-081-06: 5'-ggg ctt ccc gec ate aag aag acg gag aag acc-3' (SEQ ID NO:179) to 
generate Taq 4M G504K (DNA sequence SEQ ID NO: 177; amino acid sequence SEQ ID 
NO: 178), and primer 330-088-04: 5'-cta ggg ctt ccc gec ate aag aag acg caa aag acc ggc-3' 
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(SEQ ID NO: 182) to generate the Taq 4M G80E mutant (DNA sequence SEQ ID NO: 180; 
amino acid sequence SEQ ID NO:181). 

5. Construction of Taq 4M P88E/P90E and Taq 4M L109F/A110T 
Starting with Taq 4M described above, site specific mutagenesis was done using 
5 primer 473-087-03: 5'-ccg ggg aaa gtc etc etc cgt etc ggc ccg gec cgc ctt-3' (SEQ ID 

NO: 185) to generate the P88E/P90E mutations (DNA sequence SEQ ID NO: 183; amino acid 
sequence SEQ ID NO: 184), or primer 473-087-05: 5 '-egg gac etc gag gcg cgt gaa ccc cag 
gag gtc cac-3' (SEQ ID NO: 188) to generate the L109F/A110T mutations (DNA sequence 
SEQ ID NO: 186; amino acid sequence SEQ ED NO: 187). 
10 6. Construction of Taq DN RX HT (W417L/G418K/G499R/A502K/I503L 

/G504K/ E507K/H784A) 
Two PCR reactions were performed, first using construct Taq4M (Taq 
JJ W417L/G418K/G504K/E507Q/H784A) as a template. Using primers 158-84-01 
if] 5'-CTCCTCCACGAGTTCGGC-3' (SEQ ID NO:191) and 535-33-02 5'-ACC GGT CTT 

H5 CTT CGT CTT CTT CAA CTT GGG AAG CCT GAG CTC GTC AAA-3' (SEQ ID 
O NO: 192) a 620 base pair PCR fragment was generated. Another 510 base pair PCR product 

f was generated using primer 535-33-01 5'-AAG ACG AAG AAG ACC GGT AAG CGC TCC 
O ACC AGC-3' (SEQ ID NO: 193) and 330-06-03 5'-GTC GAC TCT AGA TCA GTG GTG 
fy GTG GTG GTG GTG CTT GGC CGC CCG GCG CAT C-3' (SEQ ID NO: 194). The two 
120 PCR products overlap such that a final recombinant PCR amplification was done using the 
CI outside primers 158-84-01 and 330-06-03 to yield the 1182 base pair product. The 

recombinant PCR product was digested with the restriction enzymes NotI and BamHI 
according to the manufacturer's instructions to yield a 793 base pair fragment. The parent 
plasmid Taq4M was also digested with the same enzymes and used as the vector for ligation. 
25 All DNA fragments were TAE agarose gel purified prior to ligation. The fragment was 
ligated into the vector, and transformed into JM109 cells, thus incorporating the mutations 
G499R, A502K, I503L, and E507K as well as the restriction endonuclease site, Agel. This 
construct is termed "Taq 8M" (DNA sequence SEQ ID NO: 189; amino acid sequence SEQ ID 
NO: 190). 
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B. Random Mutagenesis 

Numerous enzymes with altered function were generated via random mutagenesis. 
The regions of the protein targeted for random mutagenesis were chosen based on molecular 
modeling data and from information in the literature. Different mutagenic primers were used 
to introduce mutations into different regions of the protein. Random mutagenesis was 
performed on the Taq variant Taq 4M G504K (Taq DN RX HT 

W417L/G418K/G504K/E507Q/H784A/) (SEQ ID NO:177) described above and mutant PCR 
fragments generated in the mutagenesis reaction were exchanged for homologous regions in 
Taq8M (SEQ ID NO: 189) unless otherwise stated. 

Random mutagenesis was also performed on the Tth DN RX HT H786A (SEQ ID 
NO: 162) described above. Mutant PCR fragments generated with the Tth DN RX HT H786A 
template were exchanged for homologous regions in the unaltered Tth DN RX HT H786A. 

1. Random mutants in amino acid residues 500-507 or 513-520 

The first mutagenic oligonucleotide, 535-054-01: 5'-gga gcg ctt acc ggt ctt (ttg cgt ctt 
ctt gat ctt ggg aag) cct tag etc gtc aaa gag-3' (SEQ ID NO: 195) was used in conjunction with 
158-84-01: 5'-CTC CTC CAC GAG TTC GGC-3' (SEQ ID NO:196) to install random 
residues from amino acid position 500 to 507 of Taq polymerase variant Taq DN RX HT 
W417L/G418K/G504K/E507Q/H784A (SEQ ID NO:177). This was accomplished by 
synthesizing the primer 535-054-01 such that only 91% of the bases within the parenthesis are 
unaltered while the remaining 9% of the bases are an equal mixture of the other 3 nucleotides. 
The initial, unaltered sequence of this oligo includes the G499R, A502K and the Q507K 
changes. 

To generate mutations in the region 500-507, primer 535-054-01 and primer 158-84-01 
were used in a PCR reaction, using the Advantage cDNA PCR kit (Clonetech) and Taq 
variant described above, as the target. This PCR fragments was then run on a 1% TAE 
agarose gel, excised and purified with QIAquick Gel Extraction Kit (Qiagen, Valencia CA, 
catalog # 28706). The purified fragment was cut with NotI and Agel and ligated into 
pTaq8M that had been linearized with NotI and AgeL JM109 E.coli cells (Promega) were 
transformed with the ligated products. Clones were tested as described below. 
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The second mutagenic oligonucleotide (used in a separate reaction) 535-054-02: 5'-caa 
aag acc ggt aag cgc (tec acc age gec gec gtc ctg gag) gec etc cgc gag gec cac-3' (SEQ ID 
NO: 197) was used in conjunction with 330-06-03: 5' -GTC GAC TCT AGA TCA GTG GTG 
GTG GTG GTG GTG CTT GGC CGC CCG GCG CAT C-3' (SEQ ID NO: 198) to install 
random residues from amino acid 513-520. The bases within the parenthesis of primer 
535-054-02 are also 91% wild-type and 3% each of the other 3 nucleotides. 

To generate mutations in the region 513-520, primer 535-054-02 and primer 
535-054-02 were used in a PCR reaction with Taq DN RX HT W417L/G418K/G504K/ 
E507Q/H784A (SEQ ID NO: 177) as template, as described above. The resulting PCR 
fragment was purified as above and cut with the restriction enzymes Agel and BamHI. The 
cut fragment was then ligated into the Taq8M construct, also linearized with Agel and 
BamHI. JM109 E.coli cells were transformed with the ligated products. Clones were tested 
as described Example 1. Mutants developed from these include: 

Taq DN RX HT W417L/G418K/G499R/A502K/K504N/E507K/H784A (Ml-13) (DNA 
sequence SEQ ID NO: 199; amino acid sequence SEQ ID NO:200). 
Taq DN RX HT W417L/G418K/G499R/L500I/A502K/G504K/Q507H/H784A (Ml -36) 
(DNA sequence SEQ ID NO:201; amino acid sequence SEQ ID NO:202). 
Taq DN RX HT W417L/G418K/G499R/A502K/I503L/G504K/E507K/T514S/H784A (M2-24) 
(DNA sequence SEQ ID NO:203; amino acid sequence SEQ ID NO:204). 
Taq DN RX HT W417L/G418K/G499R/A502K/I503L/G504K/E507K/ V518L/H784A 
(M2-06) (DNA sequence SEQ ID NO:205; amino acid sequence SEQ ID NO:206). 
2. TthDN RX HT H786A random mutagenesis 

To generate mutants in the helix-hairpin-helix region of the TthDN RX HT H786A 
(SEQ ID NO: 163) enzyme, two different PCR reactions were performed using the H786A 
(SEQ ID NO: 162) mutant as a template. The two PCR products overlap such that a 
recombinant PCR reaction can be performed (Higuchi, in PCR Technology, H. A. Erlich, ed., 
Stockton Press, New York. pp61-70 [1989]). This final PCR product is then exchanged with 
the homologous region of the TthDN H786A mutant by using restriction enzyme sites located 
on the ends of the fragment and within the TthDN H786A sequence. 
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Starting with TthDN H786A discussed above, and using primer 604-08-06: 5 '-gtc gga 
ggg gtc ccc cac gag-3' (SEQ ID NO:207) and primer 390-76-08: S'-tgt gga att gtg age gg 
(SEQ ID NO:208), a 620 base pair PCR fragment was generated. PCR reactions were 
performed using the Advantage cDNA PCR kit (Clontech) according to manufacturer's 
instructions. This PCR product includes amino acids 1-194. No mutations were introduced 
via this reaction, however the restriction enzyme site EcoRI is present at the 5' end. 

Starting with TthDN RX HT H786A discussed above, and using mutagenic primer 
604-08-05: 5' -etc gtg ggg gac ccc tec gac aac etc (ccc ggg gtc aag ggc ate ggg gag aag acc 
gec) etc aag ctt etc aag-3 J (SEQ ID NO:209) and primer 209-74-02: 5 ? -gtg gec tec ata tgg gec 
agg ac-3' (SEQ ID NO:210) a 787 base pair PCR fragment was generated. PCR reactions 
were done as above. This fragment does contain random mutations, due to the presence of the 
mutagenic primer, 604-08-05. The bases within the parenthesis of this primer were 
synthesized such that 91% of the sequence is wild-type, while the additional 9% is evenly 
divided between the remaining 3 bases. 

The two PCR fragments overlap, and were combined in a recombinant PCR reaction. 
Primers 390-76-08 and 209-74-02 were added, and the Advantage cDNA PCR kit (Clontech) 
was again used according to manufacturer's instructions. A 1380 base pair product was 
generated from this reaction. 

The recombinant PCR product was cut with the restriction enzymes EcoRI and NotI 
according to the manufacturer's instructions to yield a 986 base pair fragment. TthDN RX 
HT H786A was prepared by cutting with the same enzymes. The fragment was then ligated 
into the vector, and transformed into JM109 cells. New mutants developed from this set of 
reactions include: 

TthDN RX HT H786A/P197R/K200R (DNA sequence SEQ ID NO:211; amino acid sequence 
SEQ ID NO:212). 

TthDN RX HT H786A/K205Y (DNA sequence SEQ ID NO:213; amino acid sequence SEQ 
ID NO:214). 

TthDN RX HT H786A/G203R (DNA sequence SEQ ID NO:215; amino acid sequence SEQ 
ID NO:216). 
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3. Construction of Taq DN RX HT W417L/G418K/H784A 
L109F/A110T/G499R/A502K/I503L/G504K/E507K/T514S (Taq SS) 

Starting with Taq DN RX HT 

W417L/G418K/G499R/A502K/I503L/G504K/E507K/T514S/H784A (SEQ ID NO:203) mutant 
described above, primer 473-087-05: 5 '-egg gac etc gag gcg cgt gaa ccc cag gag gtc cac-3' 
(SEQ ID NO:219) was used in conjunction with the appropriate selection primer in a site 
specific mutagenesis reaction to incorporate the L109F and A110T mutations to generate this 
enzyme, termed "TaqSS" (DNA sequence SEQ ID NO:217; amino acid sequence SEQ ID 
NO:218). 

4. Construction of Taq DN RX HT W417L/G418L/H784A 
P88E/P90E/G499R/A502K/I503L/G504K/E507K/T514S 

Starting with Taq DN RX HT 

W417L/G418K/G499R/A502K/I503L/G504K/E507K/T514S/H784A (SEQ ID NO:203) mutant 
described above, primer 473-087-03: 5'-ccg ggg aaa gtc etc etc cgt etc ggc ccg gec cgc ctt-3' 
(SEQ ID NO:222) was used in conjunction with the appropriate selection primer in a site 
specific mutagenesis reaction to incorporate the P88E and P90E mutations to generate this 
enzyme (DNA sequence SEQ ID NO:220; amino acid sequence SEQ ID NO:221). 
5. TaqSS random mutagenesis 

Random mutagenesis was used to introduce additional changes in the 
helix-hairpin-helix domain of the TaqSS mutant (SEQ ID NO:217). The mutagenesis was 
done as described in example 9 above. In the first step, two different but overlapping PCR 
products were generated. One of the PCR products, generated with oligos 390-76-08 (SEQ 
ID NO:208), and 604-08-04: 5'-gtc gga etc gtc acc ggt cag ggc-3' (SEQ ID NO:223) 
incorporates the EcoRI site into the fragment, but does not incorporate any mutations. The 
second PCR product utilizes mutagenic primer 604-08-03: 5'-ctg acc ggt gac gag tec gac aac 
ctt (ccc ggg gtc aag ggc ate ggg gag aag acg gcg) agg aag ctt ctg gag-3' (SEQ ID NO:224) 
and primer 209-74-02 (SEQ ID NO:210). This fragment contains random point mutations, 
and when combined via recombinant PCR with the first fragment, can be cut with the 
restriction enzymes EcoRI and NotI, and ligated into the TaqSS construct, also cut with 
EcoRI and NotI. The ligated construct was then transformed into JM109. Colonies were 
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screened as described below. Enzymes developed from this mutagenesis include: 

TaqSS K198N (DNA sequence SEQ ID NO:225; amino acid sequence SEQ ID NO:226). 
TaqSS A205Q (DNA sequence SEQ ID NO:227; amino acid sequence SEQ ID NO:228). 
5 TaqSS I200M/A205G (DNA sequence SEQ ID NO:229; amino acid sequence SEQ ID 
NO:230). 

TaqSS K203N (DNA sequence SEQ ID NO:231; amino acid sequence SEQ ID NO:232). 

TaqSS T204P (DNA sequence SEQ ID NO:233; amino acid sequence SEQ ID NO:234). 
6. Construction of TaqSS R677A 
10 To generate enzymes with sequence changes in both the arch region and in the 

polymerase region, additional specific point mutations were generated in TaqSS. Site specific 

mutagenesis was performed as described above using the oligo 473-060-10: 5 '-tag etc ctg gga 
J gag ggc gtg ggc cga cat gcc-3' (SEQ ID NO:237) to generate the TaqSS R677A mutant 
U] (DNA sequence SEQ ID NO:235; amino acid sequence SEQ ID NO:236). 
15 7. Construction of TaqTthAKK (DNA sequence SEQ ID NO:238; amino acid 

g sequence SEQ ID NO:239) and TthTaq5M (DNA sequence SEQ ID NO:240; amino acid 
4 sequence SEQ ID NO:241) 

O Chimeric mutant TaqTthAKK and TthTaq5M were generated by cutting Tth DN RX 

;)[ HT (H786A/G506K/Q509K) (SEQ ID NO: 165; here abbreviated TthAKK) or Taq 4M G504 
jlO (SEQ ID NO: 177; here abbreviated Taq 5M) with the restriction endonucleases EcoRI and 
Q Notl. The smaller insert fragments as well as the larger vector fragments were gel purified as 
detailed in Example 3D, and the insert fragments were exchanged between the two mutants 
and ligated as described in Example 3D. Screening and verification of the construct sequence 
was also done as in Example 3D. 

25 

EXAMPLE 8 

Improvement of RNA-dependent 5' nuclease activity in other polymerases 

Information gained from the TaqPol/TthPol recombinations, mutagenesis and 
30 modeling, was used to make comparable mutations in additional DNA polymerases and 
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examined the effects on the cleavage activities of these enzymes. The DNA polymerases of 
Thermus filiformus (TfiPol) and Thermus scotoductus (TscPol) were cloned and purified as 
described in Example 2. The mutagenesis of these two proteins is described below. 
A. Construction of TfiPolDN2M 
5 Mutagenesis of pTrc99a-TfiPol (SEQ ID NO:48) was done using the QuikChange 

site-directed mutagenesis kit (Stratagene) according to the manufacturer's protocol. The 
P420K mutation was made with the following two oligonucleotides; 
5 ' -CTTCCAGAACCTCTTT AAACGGCTTTCCGAGAAG (SEQ ID NO:244) and 
5 ' -CTTCTCGGAAAGCCGTTT AAAGAGGTTCTGGAAG (SEQ ID NO:245). The E507Q 

10 mutation was made with the following two oligonucleotides; 

5 ' -CCGGTGGGCCGGACGCAGAAGACGGGC AAGC (SEQ ID NO:246) and 
5'-GCTTGCCCGTCTTCTGCGTCCGGCCCACCGG (SEQ ID NO:247). The D785N 

y?j mutation was made with the following two oligonucleotides; 

5 ' -CTCCTCC AAGTGC AC AACGAGCTGGTCCTGG (SEQ ID NO:248) and 

4J5 5 ' -CCAGGACCAGCTCGTTGTGCACTTGGAGGAG (SEQ ID N0.249). The plasmid 

Pi containing all three mutations is called pTrc99a-TfiPolDN2M, (DNA sequence SEQ ID 

# NO:242; amino acid sequence SEQ ID NO:243). 

O B. Construction of TscPolDN2M 

fjj Mutagenesis of pTrc99a-TscPol (SEQ ID NO:51) was done with the QuikChange 

;|0 site-directed mutagenesis kit (Stratagene) according to the manufacturer's protocol. The 
O E416K mutation was made with the following two oligonucleotides; 

5 ' -GCCGCCCTCCTGAAGCGGCTT AAGGG (SEQ ID NO:252) and 

5 ' -CCCTT AAGCCGCTTCAGGAGGGCGGC (SEQ ID NO:253). The E505Q mutation was 

made with the following two oligonucleotides; 
25 5 ' - ATCGGCAAGACGCAGAAGACGGGCAAGC (SEQ ID NO:254) and 

5 ' -GCTTGCCCGTCTTCTGCGTCTTGCCGAT (SEQ ID NO:255). The D783N mutation 

was made with the following two oligonucleotides; 

5 ' -TTGCAGGTGCACAACGAACTGGTCCTC (SEQ ID NO:256) and 

5 ' -GAGGACCAGTTCGTTGTGC ACCTGC AA (SEQ ID NO:257). The plasmid containing 
30 all three mutations is called pTrc99a-TscPolDN2M, (DNA sequence SEQ ID NO:250; amino 
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acid sequence SEQ ID NO:251). 

C. Chimerics of Tsc, Tfi, Tth and Taq mutants 

1. Construction of TfiTth AKK (DNA sequence SEQ ID NO:258; amino acid 
sequence SEQ ID NO:259), TscTthAKK (DNA sequence SEQ ID NO:260; 
amino acid sequence SEQ ID NO:261), TfiTaqSM (DNA sequence SEQ ID 
NO:262; amino acid sequence SEQ ID NO:263) and TscTaqSM (DNA 
sequence SEQ ID NO:264; amino acid sequence SEQ ID NO:265) 
To generate chimeric enzymes between Tth DN RX HT (H86A/G506K/Q509K) (here 
abbreviated TthAKK, SEQ ID NO: 165) or Taq 4M G504 (here abbreviated Taq 5M ? SEQ ID 
NO: 177), and Tfi DN 2M (SEQ ID NO:242), or Tsc DN 2M (SEQ ID NO:250), additional 
restriction endonuclease sites were introduced by site specific mutagenesis into the named Tfi 
and Tsc mutants. Mutagenic primers 700-011-01 5'-cag acc atg aat tec acc cca ctt ttt gac ctg 
gag-3 ? (SEQ ID NO:275) and 700-011-02 5'-gtg gac gcg gec gec cga ggc cgc cgc cag ggc 
cag-3 ? (SEQ ID NO:276) were used to introduce an EcoRI site at amino acid position 1 and a 
NotI site at amino acid position 331 in Tfi DN 2M. Mutagenic primers 700-011-03 5'-cag 
acc atg aat tec ctg ccc etc ttt gag ccc aag-3' (SEQ ID NO:277) and 700-011-04 5'-gta aac cgc 
gec gec cca ggc ggc ggc caa ggc gtt-3' (SEQ ID NO:278) were used to introduce an EcoRI 
site at amino acid position 1 and a NotI site at amino acid position 327 in Tsc DN 2M. PCR 
reactions were done using the Advance cDNA PCR kit (Clonetech) according to 
manufacturer's instructions and either Tfi DN 2M or Tsc DN 2M as target, with their 
corresponding primers. The 1017 base pair PCR products were cut with both EcoRI and NotI 
to yield 993 base pair insert fragments that were gel purified as described in Example 3D. 
The mutants Taq4M G504K (SEQ ID NO: 177) and Tth DN RX HT (H786A/G506K/Q509K) 
(SEQ ID NO: 165) were also cut with EcoRI and NotI, and the larger, vector fragment was gel 
isolated as above. Ligations were performed as detailed in Example 3D, as was the 
screening and verification of the new constructs. 
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TABLE 1 



Klenow Kcat ( s 1 ) 



Wild-Type 
S610A 
R668A 
N678A 
E710A 
E710D 
K758A 
K758R 
Y766S 
R841A 
N845A 
N845Q 
Q849A 
Q849E 
H881A 
D882N 
D882S 



2.4 
n. d. 
0.006 
n. d. 
0.1 
1.7 
0.131 
2.0 
0.8 
0.3 
1.0 
0.03 
0.02 
0.001 
0.3 
O.0001 
0.001 



Km(dNTP) 

2.8 

6.5 

15 
7.7 
15.6 
2.1 
6.4 
9.8 
23 
1.7 
3.8 
n. d. 
3.3 
n. d. 
7.5 



KdfaM) 
8 

n. d. 

140, 150 
n. d. 
250 
110 



13 
40, 53 

8,5 
80,55 
100, 160 
90,91 
20, 28 

30 

0.9 



Relative DNA 
affinity 
1 

0.06, 0.05 

0.03 
0.07 
0.63 
1.125 
0.4, 0.6 

0.2 
1.0, 1.7 
0.1,0.2 
0.08, 0.05 
0.09 
0.4, 0.3 
0.6 
9 



Reference Tag Pol 



2 
5 

1,2 
5 
2 
2 
4 
4 

1,2 
1,2 
1,2 
1,2 
1,2 
1,2 
1,2 
2 
2 



Wild-Type 
S515 
R573* 
N583 
E615* 
E615* 
K663 
K663 
Y671 
R746* 
N750* 
N750* 
Q754 
Q754 
H784* 
D785 
D785 



References: 

1. JBC (1990) 265:14579-14591 

2. JBC (1992) 267:8417-8428 

3. Eur. J. Biochem (1993) 214:59-65 

4. JBC (1994) 269:13259-13265 

5. Nature (1996) 382:278-281 
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TABLE 2: Rational mutations in the polymerase region 
A. DNA activity table 





IdT 


%Tth %Taq4M 


[ HP 


X 


Tth DN RX HT 


31.91 


100% 


83% 


3.81 


101.9 


Tth DN RX HT 


23.61 


74% 


62% 


5.32 


221.24 


H641A 












Tth DN RX HT 


22.1 


69% 


58% 


4.39 


88.17 


R748A 












Tth DN RX HT 


34.31 


108% 


90% 


7.75 


185.35 


H786A 












Tth DN RX HT 


32.1 


101% 


84% 


5.7 


332.8 


H786A/G506K/ 












Q509K (AKK) 












TaqDNRXHT 


38.23 


120% 


100% 


68.21 


1100.18 


W417L/G418K/ 












E507Q/H784A 
(Taq 4M) 












Taq 4M G504K 


36.04 


113% 


94% 


31.76 


417.40 


Taq 4M H639A 


42.95 


135% 


112% 


91.46 


2249.67 


Taq 4M R587A 


44.78 


140% 


117% 


143.0 


252.69 


Taq DN RX HT 


43.95 


138% 


115% 


122.53 


346.56 


W417L/G418K7 












G499R/A502K/ 












I503L/K504N/ 












E07K/H784A 












(Taq8M) 












TaqSS R677A 


32.3 


101% 


84% 


206.9 


2450.0 
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B. RNA activity table 

IrTl %Tth %Taq4M 



TthDNRXHT 


0.89 


100% 


34% 


TthDNRXHT 


1.18 


133% 


45% 


H641A 








TthDNRXHT 


1.34 


151% 


51% 


R748A 








TthDNRXHT 


1.31 


147% 


49% 


H786A 








TthDNRXHT 


1.59 


179% 


60% 


H786A/G506K/ 








Q509K (AKK) 








TaqDNRXHT 


2.65 


298% 


100% 


W417L/G418K/ 








E507Q/H784A 
(Taq 4M) 








Taq 4M G504K 


2.76 


310% 


114% 


Taq 4M H639A 


3.89 


437% 


147% 


Taq 4M R587A 


3.13 


352% 


118% 


TaqDNRXHT 


4.00 


450% 


151% 


W417L/G418K/ 








G499R/A502K/ 








I503L/K504N/ 








E07K/H784A 








(Taq8M) 








TaqSS R677A 


2.22 


249% 


84% 



TABLE 3: Rational arch mutations 
DNA activity table 





IdT 


%Tth 


%Taq4M 


HP 


X 


Taq4M 


10.20 


32% 


27% 


2.00 


97.00 


P88E/P90E 












Taq4M 


26.30 


82% 


69% 


103.6 


2900 


G80E 












Taq4M 


36.45 


114% 


95% 


19.71 


749.69 


L109F/A110T 













RNA activity table 



IrTl %Tth %1 


raq4M 


Taq4M 
P88E/P90E 


Taq4M 
P88E/P90E 


0.10 


11% 


4% 


Taq4M 
G80E 


Taq4M 
G80E 


3.11 


349% 


117% 


Taq4M 
L109F/A110T 


Taq4M 
L109F/A110T 


2.45 


275% 


92% 
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TABLE 4: Arch/thumb combinations 



DNA activity table 

IdT %Tth %Taq4M HP X 



TaqW417L/ 

G418K/E507K/ 

H784A/L109F/ 

A110T/G499R/ 

A502K/I503L/ 

G504K/E507K/ 

T514S(TaqSS) 


63.33 


198% 


166% 


177.05 


202.32 


TaqP88E/P90E/ 

W417L/G418K/ 

G499R/A502K/ 

I503L/G504K/ 

E507K/T514S/ 

H784A 


36.48 


114% 


95% 


9.44 


70.35 


RNA activity table 

IrTl %Tth %Taq4M 




Taq W417L/ 

G418K/E507K7 

H784A/L109F/ 

A110T/G499R/ 

A502K/I503L/ 

G504K/E507K/ 

T514S (Taq SS) 


3.16 


355% 


119% 




TaqP88E/P90E/ 

W417L/G418K/ 

G499R/A502K/ 

I503L/G504K/ 

E507K/T514S/ 

H784A 


0.22 


25% 


8% 



TABLE 5: Helix-hairpin-helix random mutagenesis 
DNA activity table 

IdT %Tth %Taq4M HP X 



TaqSS 
K198N 


23.4 


73% 


61% 


25.7 


1233.1 


TaqSS 
A205Q 


25.6 


80% 


67% 


13.4 


699.1 


TaqSS 
T204P 


11.2 


35% 


29% 


1.9 


209.4 


TaqSS 

I200M/A205G 


16.8 


53% 


44% 


7.8 


597.2 


TaqSS 
K203N 


25.9 


81% 


68% 


36.6 


1429.8 


TthDNRXHT 
H786A/P197R/K200R 


10.7 


33% 


28% 


3.2 


66.3 


TthDNRXHT 
H786A/K205Y 


11.5 


36% 


30% 


6.1 


327.5 


TthDNRXHT 
H786A/G203R 


18.3 


57% 


48% 


2.1 


98.8 



RNA activity table 



IrTl %Tth %Taq4M 



TaqSS 
K198N 


1.22 


137% 


46% 


TaqSS 
A205Q 


0.62 


70% 


23% 


TaqSS 
T204P 


0.36 


40% 


14% 


TaqSS 

I200M/A205G 


0.77 


87% 


29% 


TaqSS 
K203N 


2.09 


235% 


79% 


TthDNRXHT 
H786A/P197R/K200R 


0.47 


52% 


18% 


TthDNRXHT 
H786A/K205Y 


0.68 


77% 


26% 


TthDNRXHT 
H786A/G203R 


1.61 


180% 


61% 
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TABLE 6: Random thumb mutations 
DNA activity table 

IdT %Tth %Taq4M HP X 



TaqDNRXHT 
W417L/G418K/ 
E507K/H784A/G499R/ 
A502K/K504N/(M1-13) 


59.96 


188% 


157% 


133.65 


907.41 


TaqDNRXHT 
W417L/G418K/ 
/H784A/L500I/Q507H 
A502K/G504K (Ml-36) 


46.74 


146% 


122% 


123.11 


822.61 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784A/T514S (M2-24) 


85.7 


269% 


224% 


369.96 


3752.12 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784A/V518L(M2-06) 


76.7 


240% 


201% 


355.87 


2038.19 



RNA activity table 

IrTl %Tth %Taq4M 



TaqDNRXHT 
W417L/G418K/ 
E507K/H784A/G499R/ 
A502K/K504N/ (Ml-13) 


2.55 


287% 


96% 


TaqDNRXHT 
W417L/G418K/ 
/H784A/L500I/Q507H 
A502K/G504K (Ml-36) 


2.71 


304% 


102% 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784A/T514S (M2-24) 


4.43 


498% 


167% 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784A/V518L (M2-06) 


3.56 


400% 


134% 



100 



TABLE 7: Chimeric mutants 



A. DNA activity table 





IdT2 


%TthAKK 


HP 


A 


TthAKK 


34.18 


100% 


5 


393 


Taq 4M G504K 


/in in 






1991 


Tfi DN 2M 


36.60 


106% 


289 


1326 


Tsc DN 2M 


25.49 


75% 


283 


2573 


TaqTthAKK 


63.89 


187% 


32 


1658 


TthTaq 4M G504K 


25.03 


73% 


8 


627 


TfiTthAKK 


34.13 


100% 


15 


459 


TscTthAKK 


35.23 


103% 


29 


2703 


TfiTaq 4M G504K 


35.69 


104% 


37 


872 


TscTaq 4M 
G504K 


30.04 


88% 


25 


2008 



B. RNA activity table 

IrT3 %TthAKK 



TthAKK 


2.27 


100% 


Taq 4M G504K 


2.31 


102% 


Tfi DN 2M 


0.20 


9% 


Tsc DN 2M 


0.29 


13% 


TaqTthAKK 


6.81 


300% 


TthTaq 4M 
G504K 


1.09 


48% 


TfiTthAKK 


1.24 


55% 


TscTthAKK 


9.65 


4.25% 


TfiTaq 4M G504K 


1.05 


46% 


TscTaq 4M 
G504K 


2.95 


130% 
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All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
the described modes for carrying out the invention which are obvious to those skilled in the 
relevant fields are intended to be within the scope of the following claims. 



102 



SEQUENCE LISTING 

<110> Ma, Wu-Po 

Lyamichev, Victor I. 
Kaiser, Michael W. 
Lyamicheva, Natalie E. 
Allawi, Hatim T. 
Schaefer, James J. 
Neri, Bruce P. 

<12 0> Improved Enzymes for the Detection of Specific Nucleic Acid Sequences 
<130> FORS-04323 

<140> not yet assigned 
<141> 2000-05-24 

<160> 278 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2506 
<212> DNA 

<213> Thermus aquaticus 
<400> 1 

atgaggggga tgctgcccct ctttgagccc aagggccggg tcctcctggt ggacggccac 6 0 
cacctggcct accgcacctt ccacgccctg aagggcctca ccaccagccg gggggagccg 12 0 
gtgcaggcgg tctacggctt cgccaagagc ctcctcaagg ccctcaagga ggacggggac 180 
gcggtgatcg tggtctttga cgccaaggcc ccctccttcc gccacgaggc ctacgggggg 240 
tacaaggcgg gccgggcccc cacgccggag gactttcccc ggcaactcgc cctcatcaag 300 
gagctggtgg acctcctggg gctggcgcgc ctcgaggtcc cgggctacga ggcggacgac 3 60 
gtcctggcca gcctggccaa gaaggcggaa aaggagggct acgaggtccg catcctcacc 42 0 
gccgacaaag acctttacca gctcctttcc gaccgcatcc acgtcctcca ccccgagggg 480 
tacctcatca ccccggcctg gctttgggaa aagtacggcc tgaggcccga ccagtgggcc 54 0 
gactaccggg ccctgaccgg ggacgagtcc gacaaccttc ccggggtcaa gggcatcggg 600 
gagaagacgg cgaggaagct tctggaggag tgggggagcc tggaagccct cctcaagaac 660 
ctggaccggc tgaagcccgc catccgggag aagatcctgg cccacatgga cgatctgaag 720 
ctctcctggg acctggccaa ggtgcgcacc gacctgcccc tggaggtgga cttcgccaaa 7 80 
aggcgggagc ccgaccggga gaggcttagg gcctttctgg agaggcttga gtttggcagc 840 
ctcctccacg agttcggcct tctggaaagc cccaaggccc tggaggaggc cccctggccc 900 
ccgccggaag gggccttcgt gggctttgtg ctttcccgca aggagcccat gtgggccgat 960 
cttctggccc tggccgccgc cagggggggc cgggtccacc gggcccccga gccttataaa 1020 
gccctcaggg acctgaagga ggcgcggggg cttctcgcca aagacctgag cgttctggcc 1080 
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ctgagggaag gccttggcct cccgcccggc 
gacccttcca acaccacccc cgagggggtg 
gaggcggggg agcgggccgc cctttccgag 
9 a 99999 a 99 agaggctcct ttggctttac 
ctggcccaca tggaggccac gggggtgcgc 
ctggaggtgg ccgaggagat cgcccgcctc 
cccttcaacc tcaactcccg ggaccagctg 
cccgccatcg gcaagacgga gaagaccggc 
gccctccgcg aggcccaccc catcgtggag 
ctgaagagca cctacattga ccccttgccg 
cacacccgct tcaaccagac ggccacggcc 
ctccagaaca tccccgtccg caccccgctt 
gaggaggggt ggctattggt ggccctggac 
cacctctccg gcgacgagaa cctgatccgg 
gagaccgcca gctggatgtt cggcgtcccc 
gcggccaaga ccatcaactt cggggtcctc 
gagctagcca tcccttacga ggaggcccag 
cccaaggtgc gggcctggat tgagaagacc 
gagaccctct tcggccgccg ccgctacgtg 
cgggaggcgg ccgagcgcat ggccttcaac 
atgaagctgg ctatggtgaa gctcttcccc 
cttcaggtcc acgacgagct ggtcctcgag 
cggctggcca aggaggtcat ggagggggtg 
gtggggatag gggaggactg gctctccgcc 

<210> 2 
<211> 2496 
<212> DNA 

<213> thermus flavus 
<400> 2 

atggcgatgc ttcccctctt tgagcccaaa 
ctggcctacc gcaccttctt tgccctcaag 
caggcggtct acggcttcgc caaaagcctc 
gtggtggtgg tctttgacgc caaggccccc 



gacgacccca tgctcctcgc ctacctcctg 1140 
gcccggcgct acggcgggga gtggacggag 1200 
aggctcttcg ccaacctgtg ggggaggctt 1260 
C999 a 99tgg agaggcccct ttccgctgtc 1320 
ctggacgtgg cctatctcag ggccttgtcc 1380 
gaggccgagg tcttccgcct ggccggccac 1440 
gaaagggtcc tctttgacga gctagggctt 150 0 
aagcgctcca ccagcgccgc cgtcctggag 1560 
aagatcctgc agtaccggga gctcaccaag 162 0 
gacctcatcc accccaggac gggccgcctc 1680 
ac 999 ca 99 c taagtagctc cgatcccaac 1740 
gggcagagga tccgccgggc cttcatcgcc 1800 
tatagccaga tagagctcag ggtgctggcc 1860 
gtcttccagg aggggcggga catccacacg 192 0 
cgggaggccg tggaccccct gatgcgccgg 1980 
tacggcatgt cggcccaccg cctctcccag 2 040 
gccttcattg agcgctactt tcagagcttc 2100 
ctggaggagg gcaggaggcg ggggtacgtg 2160 
ccagacctag aggcccgggt gaagagcgtg 222 0 
atgcccgtcc agggcaccgc cgccgacctc 22 80 
aggctggagg aaatgggggc caggatgctc 2 34 0 
gccccaaaag agagggcgga ggccgtggcc 24 00 
tatcccctgg ccgtgcccct ggaggtggag 2460 
aaggagtgat accacc 250 6 



ggccgcgtgc tcctggtgga cggccaccac 60 
ggcct caeca ccagccgcgg cgaacccgtt 12 0 
ctcaaggccc tgaaggagga eggggaegtg 180 
tccttccgcc aegaggecta cgaggcctac 240 
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aaggcgggcc gggcccccac cccggaggac tttccccggc agctggccct catcaaggag 3 00 
ttggtggacc tcctaggcct tgtgcggctg gaggttcccg gctttgaggc ggacgacgtg 3 60 
ctggccaccc tggccaagcg ggcggaaaag gaggggtacg aggtgcgcat cctcactgcc 420 
gaccgcgacc tctaccagct cctttcggag cgcatcgcca tcctccaccc tgaggggtac 480 
ctgatcaccc cggcgtggct ttacgagaag tacggcctgc gcccggagca gtgggtggac 540 
taccgggccc tggcggggga cccctcggat aacatccccg gggtgaaggg catcggggag 60 0 
aagaccgccc agaggctcat ccgcgagtgg gggagcctgg aaaacctctt ccagcacctg 660 
gaccaggtga agccctcctt gcgggagaag ctccaggcgg gcatggaggc cctggccctt 720 
tcccggaagc tttcccaggt gcacactgac ctgcccctgg aggtggactt cgggaggcgc 780 
cgcacaccca acctggaggg tctgcgggct tttttggagc ggttggagtt tggaagcctc 84 0 
ctccacgagt tcggcctcct ggaggggccg aaggcggcag aggaggcccc ctggccccct 90 0 
ccggaagggg cttttttggg cttttccttt tcccgtcccg agcccatgtg ggccgagctt 960 
ctggccctgg ctggggcgtg ggaggggcgc ctccatcggg cacaagaccc ccttaggggc 1020 
ctgagggacc ttaagggggt gcggggaatc ctggccaagg acctggcggt tttggccctg 1080 
cgggagggcc tggacctctt cccagaggac gaccccatgc tcctggccta ccttctggac 1140 
ccctccaaca ccacccctga gggggtggcc cggcgttacg ggggggagtg gacggaggat 12 0 0 
gcgggggaga gggccctcct ggccgagcgc ctcttccaga ccctaaagga gcgccttaag 1260 
ggagaagaac gcctgctttg gctttacgag gaggtggaga agccgctttc ccgggtgttg 132 0 
gcccggatgg aggccacggg ggtccggctg gacgtggcct acctccaggc cctctccctg 13 80 
gaggtggagg cggaggtgcg ccagctggag gaggaggtct tccgcctggc cggccacccc 144 0 
ttcaacctca actcccgcga ccagctggag cgggtgctct ttgacgagct gggcctgcct 15 00 
gccatcggca agacggagaa gacggggaaa cgctccacca gcgctgccgt gctggaggcc 1560 
ctgcgagagg cccaccccat cgtggaccgc atcctgcagt accgggagct caccaagctc 1620 
aagaacacct acatagaccc cctgcccgcc ctggtccacc ccaagaccgg ccggctccac 1680 
acccgcttca accagacggc caccgccacg ggcaggcttt ccagctccga ccccaacctg 1740 
cagaacatcc ccgtgcgcac ccctctgggc cagcgcatcc gccgagcctt cgtggccgag 1800 
gagggctggg tgctggtggt cttggactac agccagattg agcttcgggt cctggcccac 1860 
ctctccgggg acgagaacct gatccgggtc tttcaggagg ggagggacat ccacacccag 192 0 
accgccagct ggatgttcgg cgtttccccc gaaggggtag accctctgat gcgccgggcg 1980 
gccaagacca tcaacttcgg ggtgctctac ggcatgtccg cccaccgcct ctccggggag 2040 
ctttccatcc cctacgagga ggcggtggcc ttcattgagc gctacttcca gagctacccc 2100 
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aaggtgcggg cctggattga ggggaccctc 
accctcttcg gccgccggcg ctatgtgccc 
gaggcggcgg agcgcatggc cttcaacatg 
aagctggcca tggtgcggct tttcccccgg 
caggtgcacg acgagctggt cctcgaggcc 
ttggccaagg aggtcatgga gggggtctgg 
ggcctggggg aggactggct ctccgccaag 

<210> 3 
<211> 2511 
<212> DNA 

<213> Thermus thermophilus 
<400> 3 

atgaattccg aggcgatgct tccgctcttt 
ggccaccacc tggcctaccg caccttcttc 
gaaccggtgc aggcggtcta cggcttcgcc 
gggtacaagg ccgtcttcgt ggtctttgac 
tacgaggcct acaaggcggg gagggccccg 
ctcatcaagg agctggtgga cctcctgggg 
gcggacgacg ttctcgccac cctggccaag 
atcctcaccg ccgaccgcga cctctaccaa 
cccgagggcc acctcatcac cccggagtgg 
cagtgggtgg acttccgcgc cctcgtgggg 
ggcatcgggg agaagaccgc cctcaagctc 
ctcaagaacc tggaccgggt aaagccagaa 
gaagacctca ggctctcctt ggagctctcc 
gacctcgccc aggggcggga gcccgaccgg 
gagttcggca gcctcctcca cgagttcggc 
gccccctggc ccccgccgga aggggccttc 
atgtgggcgg agcttaaagc cctggccgcc 
gaccccttgg cggggctaaa ggacctcaag 
gccgtcttgg cctcgaggga ggggctagac 
gcctacctcc tggacccctc caacaccacc 
gagtggacgg aggacgccgc ccaccgggcc 



gaggagggcc gccggcgggg gtatgtggag 2160 
gacctcaacg cccgggtgaa gagcgtgcgc 222 0 
ccggtccagg gcaccgccgc cgacctcatg 2280 
cttcaggaac tgggggcgag gatgcttttg 2340 
cccaaggacc gggcggagag ggtagccgct 240 0 
cccctgcagg tgcccctgga ggtggaggtg 2460 
gagtag 2496 



gaacccaaag gccgggtcct cctggtggac 60 
gccctgaagg gcctcaccac gagccggggc 12 0 
aagagcctcc tcaaggccct gaaggaggac 180 
gccaaggccc cctccttccg ccacgaggcc 24 0 
acccccgagg acttcccccg gcagctcgcc 300 
tttacccgcc tcgaggtccc cggctacgag 360 
aaggcggaaa aggaggggta cgaggtgcgc 42 0 
ctcgtctccg accgcgtcgc cgtcctccac 480 
ctttgggaga agtacggcct caggccggag 540 
gacccctccg acaacctccc cggggtcaag 60 0 
ctcaaggagt ggggaagcct ggaaaacctc 660 
aacgtccggg agaagatcaa ggcccacctg 72 0 
cgggtgcgca ccgacctccc cctggaggtg 780 
gaggggctta gggccttcct ggagaggctg 84 0 
ctcctggagg cccccgcccc cctggaggag 90 0 
gtgggcttcg tcctctcccg ccccgagccc 960 
tgcagggacg gccgggtgca ccgggcagca 102 0 
gaggtccggg gcctcctcgc caaggacctc 10 80 
ctcgtgcccg gggacgaccc catgctcctc 1140 
cccgaggggg tggcgcggcg ctacgggggg 12 00 
ctcctctcgg agaggctcca tcggaacctc 1260 
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cttaagcgcc 


tcgaggggga 


gg&gaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


catggaggcc 


^ccggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacgacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtta 


g 


2511 



<210> 4 
<211> 832 
<212> PRT 

<213> Thermus aquaticus 
<400> 4 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 
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65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 
100 105 HO 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu 
195 200 205 

Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 
210 215 220 

Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys 
225 230 235 240 

Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala Asp 
305 310 315 320 

Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala Pro 
325 330 335 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 
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Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 
405 410 415 

Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 
420 425 430 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
725 730 735 
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Val Lys Ser Val Arg Glu Ala Ala 
740 

Val Gin Gly Thr Ala Ala Asp Leu 
755 760 

Phe Pro Arg Leu Glu Glu Met Gly 
770 775 

Asp Glu Leu Val Leu Glu Ala Pro 
785 790 

Arg Leu Ala Lys Glu Val Met Glu 
805 

Leu Glu Val Glu Val Gly He Gly 
820 



Glu Arg Met Ala Phe Asn Met Pro 
745 750 

Met Lys Leu Ala Met Val Lys Leu 
765 

Ala Arg Met Leu Leu Gin Val His 
780 

Lys Glu Arg Ala Glu Ala Val Ala 
795 800 

Gly Val Tyr Pro Leu Ala Val Pro 
810 815 

Glu Asp Trp Leu Ser Ala Lys Glu 
825 830 



<210> 5 
<211> 831 
<212> PRT 

<213> thermus flavus 
<400> 5 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ala He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
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180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Gin Arg Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Phe Gin His Leu Asp Gin Val Lys 
210 215 220 

Pro Ser Leu Arg Glu Lys Leu Gin Ala Gly Met Glu Ala Leu Ala Leu 
225 230 235 240 

Ser Arg Lys Leu Ser Gin Val His Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Gly Arg Arg Arg Thr Pro Asn Leu Glu Gly Leu Arg Ala Phe Leu 
260 265 270 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 
275 280 285 

Gly Pro Lys Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala 
290 295 300 

Phe Leu Gly Phe Ser Phe Ser Arg Pro Glu Pro Met Trp Ala Glu Leu 
305 310 315 320 

Leu Ala Leu Ala Gly Ala Trp Glu Gly Arg Leu His Arg Ala Gin Asp 
325 330 335 

Pro Leu Arg Gly Leu Arg Asp Leu Lys Gly Val Arg Gly He Leu Ala 
340 345 350 

Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Phe Pro 
355 360 365 

Glu Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr 
370 375 380 

Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp 
385 390 395 400 

Ala Gly Glu Arg Ala Leu Leu Ala Glu Arg Leu Phe Gin Thr Leu Lys 
405 410 415 

Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu Val 
420 425 430 

Glu Lys Pro Leu Ser Arg Val Leu Ala Arg Met Glu Ala Thr Gly Val 
435 440 445 

Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Glu Ala 
450 455 460 

Glu Val Arg Gin Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 
485 490 495 

Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser 
500 505 510 
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Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val 
515 520 525 

Asp Arg lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr 
530 535 540 

lie Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 

lie Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Val Leu Val Val Leu 
595 600 605 

Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Gly Val Asp Pro Leu 
645 650 655 

Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met 
660 665 670 

Ser Ala His Arg Leu Ser Gly Glu Leu Ser He Pro Tyr Glu Glu Ala 
675 680 685 

Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg Ala 
690 695 700 

Trp He Glu Gly Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu 
705 710 715 720 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 
725 730 735 

Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Arg Leu Phe 
755 760 765 

Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His Asp 
770 775 780 

Glu Leu Val Leu Glu Ala Pro Lys Asp Arg Ala Glu Arg Val Ala Ala 
785 790 795 800 

Leu Ala Lys Glu Val Met Glu Gly Val Trp Pro Leu Gin Val Pro Leu 
805 810 815 

Glu Val Glu Val Gly Leu Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 



<210> 6 
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<211> 836 
<212> PRT 

<213> Thermus thermophilus 
<400> 6 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 
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Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 
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Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 7 
<211> 2502 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 7 

atgnnggcga tgcttcccct ctttgagccc aaaggccggg tcctcctggt ggacggccac 60 
cacctggcct accgcacctt cttcgccctg aagggcctca ccaccagccg gggcgaaccg 12 0 
gtgcaggcgg tctacggctt cgccaagagc ctcctcaagg ccctgaagga ggacggggac 180 
nnggcggtgn tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 24 0 
gcctacaagg cgggccgggc ccccaccccg gaggactttc cccggcagct cgccctcatc 3 00 
aaggagctgg tggacctcct ggggcttgcg cgcctcgagg tccccggcta cgaggcggac 360 
gacgtnctgg ccaccctggc caagaaggcg gaaaaggagg ggtacgaggt gcgcatcctc 420 
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accgccgacc gcgacctcta ccagctcctt 
gggtacctca tcaccccggc gtggctttgg 
gtggactacc gggccctggc gggggacccc 
ggggagaaga ccgcccngaa gctcctcnag 
aacctggacc gggtgaagcc cgccntccgg 
angctctcct gggagctntc ccaggtgcgc 
aagnggcggg agcccgaccg ggaggggctt 
agcctcctcc acgagttcgg cctcctggag 
cccccgccgg aaggggcctt cgtgggcttt 
gagcttctgg ccctggccgc cgccagggag 
angggcctna gggacctnaa ggaggtgcgg 
gccctgaggg agggcctnga cctcntgccc 
ctggacccct ccaacaccac ccccgagggg 
gaggangcgg gggagcgggc cctcctntcc 
cttgaggggg aggagaggct cctttggctt 
gtcctggccc acatggaggc cacgggggtn 
tccctggagg tggcggagga gatccgccgc 
caccccttca acctcaactc ccgggaccag 
cttcccgcca tcggcaagac ggagaagacn 
gaggccctnc gngaggccca ccccatcgtg 
aagctcaaga acacctacat ngaccccctg 
ctccacaccc gcttcaacca gacggccacg 
aacctgcaga acatccccgt ccgcaccccn 
gccgaggagg gntgggtgtt ggtggccctg 
gcccacctct ccggggacga gaacctgatc 
acccagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagcttg ccatccccta cgaggaggcg 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccggcgctac 
gtgcgggagg cggcggagcg catggccttc 



tccgaccgca tcgccgtcct ccaccccgag 480 
gagaagtacg gcctgaggcc ggagcagtgg 540 
tccgacaacc tccccggggt caagggcatc 600 
gagtggggga gcctggaaaa cctcctcaag 660 
gagaagatcc aggcccacat ggangacctg 720 
accgacctgc ccctggaggt ggacttcgcc 780 
agggcctttc tggagaggct ggagtttggc 840 
ggccccaagg ccctggagga ggccccctgg 90 0 
gtcctttccc gccccgagcc catgtgggcc 960 
ggccgggtcc accgggcacc agaccccttt 1020 
ggnctcctcg ccaaggacct ggccgttttg 1080 
ggggacgacc ccatgctcct cgcctacctc 1140 
gtggcccggc gctacggggg ggagtggacg 12 00 
gagaggctct tccngaacct nnngcagcgc 1260 
taccaggagg tggagaagcc cctttcccgg 132 0 
cggctggacg tggcctacct ccaggccctn 1380 
ctcgaggagg aggtcttccg cctggccggc 1440 
ctggaaaggg tgctctttga cgagctnggg 1500 
ggcaagcgct ccaccagcgc cgccgtgctg 1560 
gagaagatcc tgcagtaccg ggagctcacc 162 0 
ccngncctcg tccaccccag gacgggccgc 168 0 
gccacgggca ggcttagtag ctccgacccc 1740 
ctgggccaga ggatccgccg ggccttcgtg 1800 
gactatagcc agatagagct ccgggtcctg 1860 
cgggtcttcc aggaggggag ggacatccac 192 0 
cccccggagg ccgtggaccc cctgatgcgc 1980 
ctctacggca tgtccgccca ccgcctctcc 2040 
gtggccttca ttgagcgcta cttccagagc 2100 
accctggagg agggcaggag gcgggggtac 2160 
gtgcccgacc tcaacgcccg ggtgaagagc 222 0 
aacatgcccg tccagggcac cgccgccgac 2280 
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ctcatgaagc tggccatggt gaagctcttc ccccggctnc aggaaatggg ggccaggatg 2 340 
ctcctncagg tccacgacga gctggtcctc gaggccccca aagagcgggc ggaggnggtg 240 0 
gccgctttgg ccaaggaggt catggagggg gtctatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga tgggggagga ctggctctcc gccaaggagt ag 2502 

<210> 8 
<211> 833 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 8 

Met Xaa Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val Xaa Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Xaa Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg lie Ala Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Glu Gin Trp Val Asp Tyr Arg Ala Leu Xaa Gly Asp Pro Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Xaa Lys Leu Leu 
195 200 205 

Xaa Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val 
210 215 220 

Lys Pro Xaa Xaa Arg Glu Lys lie Xaa Ala His Met Glu Asp Leu Xaa 
225 230 235 240 
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Leu Ser Xaa Xaa Leu Ser Xaa Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Phe Ala Xaa Arg Arg Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Xaa Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Leu Ala Leu Ala Ala Ala Arg Xaa Gly Arg Val His Arg Ala Xaa 
325 330 335 

Asp Pro Leu Xaa Gly Leu Arg Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Xaa 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Phe Xaa Asn Leu 
405 410 415 

Xaa Xaa Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Xaa Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 

Tyr lie Asp Pro Leu Pro Xaa Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 
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Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Xaa Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Xaa Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Xaa Arg Ala Glu Xaa Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Xaa Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

Xaa 



<210> 9 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 9 

uucgcuuucu ucccuuccuu ucucgccacg uucgccggcu uuccccguca age 
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<210> 10 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 10 

acggggaaag ccggcgaacg tggcgagaaa 



<210> 11 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 11 

attagaaagg aagggaagaa agcgaa 



<210> 12 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 12 

acggggaaag ccggcgaacg tggcgagaac 



<210> 13 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 13 

cttgacgggg aaagccggcg aacgtggcgc 



<210> 14 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 14 

agaaaggaag ggaagaaagc gaa 



<210> 15 
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<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 15 

agggagaagg caactggacc gaaggcc 2 7 



<210> 16 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> (1) 

<223> This 5' end has a fluorescein label. 
<220> 

<221> misc_f eature 
<222> (35) 

<223> This 3' end is modified with a dideoxynucleotide . 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 16 

ncgaaattaa tacgcttgtg gagaaggagt tcatn 3 5 



<210> 17 

<211> 640 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 17 
gggagcccag 


cuaugaacuc 


cuucuccaca 


agcgccuucg 


guccaguugc 


cuucucccug 


60 


gggcugcucc 


ugguguugcc 


ugcugccuuc 


ccugccccag 


uacccccagg 


agaagauucc 


120 


aaagauguag 


ccgccccaca 


cagacagcca 


cucaccucuu 


cagaacgaau 


ugacaaacaa 


180 


auucgguaca 


uccucgacgg 


caucucagcc 


cugagaaagg 


agacauguaa 


caagaguaac 


240 


augugugaaa 


gcagcaaaga 


ggcacuggca 


gaaaacaacc 


ugaaccuucc 


aaagauggcu 


300 


gaaaaagaug 


gaugcuucca 


aucuggauuc 


aaugaggaga 


cuugccuggu 


gaaaaucauc 


360 


acuggucuuu 


uggaguuuga 


gguauaccua 


gaguaccucc 


agaacagauu 


ugagaguagu 


420 


gaggaacaag 


ccagagcugu 


ccagaugagu 


acaaaagucc 


ugauccaguu 


ccugcagaaa 


480 


aaggcaaaga 


aucuagaugc 


aauaaccacc 


ccugacccaa 


ccacaaaugc 


cagccugcug 


540 


acgaagcugc 


aggcacagaa 


ccaguggcug 


caggacauga 


caacucaucu 


cauucugcgc 


600 
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agcuuuaagg aguuccugca guccagccug agggcucuuc 



640 



<210> 18 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 18 

gctatgaact ccttctccac aagcgccttc ggtccagttg ccttctccct ggg 53 



<210> 19 
<211> 214 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 19 

Asp Pro Ser Asn Thr 
1 5 

Glu Trp Thr Glu Glu 
20 

Phe Ala Asn Leu Trp 
35 

Leu Tyr Arg Glu Val 
50 

Glu Ala Thr Gly Val 
65 

Leu Glu Val Ala Glu 
85 

Leu Ala Gly His Pro 
100 

Val Leu Phe Asp Glu 
115 

Thr Gly Lys Arg Ser 
130 

Ala His Pro He Val 
145 

Leu Lys Ser Thr Tyr 
165 

Thr Gly Arg Leu His 
180 

Arg Leu Ser Ser Ser 
195 



Thr Pro Glu Gly 



Ala Gly Glu Arg 
25 

Gly Arg Leu Glu 
40 

Glu Arg Pro Leu 
55 



Val Ala Arg 
10 

Ala Ala Leu 



Gly Glu Glu 



Ser Ala Val 
60 



Arg Tyr Gly Gly 
15 

Ser Glu Arg Leu 
30 

Arg Leu Leu Trp 
45 

Leu Ala His Met 



Arg Leu Asp Val 
70 



Ala Tyr Leu 
75 



Arg Ala Leu Ser 
80 



Glu He Ala Arg 



Leu Glu Ala 
90 



Glu Val Phe Arg 
95 



Phe Asn Leu Asn 
105 

Leu Gly Leu Pro 
120 



Ser Arg Asp 
Ala He Gly 



Gin Leu Glu Arg 
110 

Lys Thr Glu Lys 
125 



Thr Ser Ala Ala 
135 



Val Leu Glu 
140 



Ala Leu Arg Glu 



Glu Lys He Leu 
150 

He Asp Pro Leu 



Thr Arg Phe Asn 
185 



Gin Tyr Arg 
155 

Pro Asp Leu 
170 

Gin Thr Ala 



Glu Leu Thr Lys 
160 

He His Pro Arg 
175 

Thr Ala Thr Gly 
190 



Asp Pro Asn Leu 
200 



Gin Asn He 



Pro Val Arg Thr 
205 



122 



Pro Leu Gly Gin Arg lie 
210 



<210> 20 
<211> 214 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 20 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
15 10 15 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
20 25 30 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
35 40 45 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
50 55 60 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
65 70 75 80 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
85 90 95 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
100 105 110 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
115 120 125 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
130 135 140 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
145 150 155 160 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
165 170 175 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
180 185 190 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
195 200 205 

Pro Leu Gly Gin Arg lie 
210 



<210> 21 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 



Sequence 
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<221> mi sc_f eature 
<222> (1) 

<223> This 5' end has a fluorescein label. 
<220> 

<221> misc_f eature 
<222> (6) 

<223> The residue at this position is a cy3 abasic 
linker group. 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 21 

ncgctntctc gctcgc 16 



<210> 22 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 22 

acggaacgag cgtctttg 18 



<210> 23 

<211> 32 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 23 

gcgagcgaga cagcgaaaga cgcucguucc gu 32 



<210> 24 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 24 

gcgagcgaga cagcgaaaga cgctcgttcc gt 32 



<210> 25 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 25 
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acggaacgag cgtctttcat ctgtcaatc 



29 



<210> 26 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> (1) 

<223> This 5' end is labeled with 
tetrachlororf luorescein , 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 26 

nttttcaact gccgtga 17 



<210> 27 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> (1) 

<223> This 5' end is modified with a biotin-streptavitin 
complex . 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 27 

nucacggcag uuggugcgcc ucggaacgag gcgcacg 37 



<210> 28 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 28 

tcacggcagt tggtgcgcct cggaacgagg cgcacg 3 6 



<210> 29 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> (30) 

<223> This 3' end is modified with an amine moiety. 
<220> 
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<223> Description of Artificial Sequence: Synthetic 
<400> 29 

cggaggaagc agttggtgcg cctcgttaan 3 0 



<210> 30 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> (1) 

<223> This 5' end is labeled with fluorescein. 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 30 

ntccttctca actgcttcct ccg 23 



<210> 31 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mi sc_f eature 
<222> (28) 

<223> This 3' end is modified with a biotin moiety. 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 31 

aacgaggcgc acctcaaatc tccctttn 2 8 



<210> 32 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> (1) 

<223> This 5' end is labeled with fluorescein. 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 32 

nttttcgctg tctcgct 17 



<210> 33 

<211> 13 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 33 
acgagcgtct ttg 



<210> 34 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> (1) 

<223> This 5' end is labeled with fluorescein. 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 34 

nagcgagaca gcgaaagacg ctcgtt 



<210> 35 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 35 

ucacggcagu uggugcggaa cgcacg 



<210> 36 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 36 

tcacggcagt tggtgcggaa cgcacg 



<210> 37 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 37 

cacgaattcg gggatgctgc ccctctttga gcccaa 



<210> 38 
<211> 34 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 38 

gtgagatcta tcactccttg gcggagagcc agtc 34 

<210> 39 
<211> 2502 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 39 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 4 80 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 90 0 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggg ggccgggtcc accgggcccc cgagccttat 1020 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 108 0 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt ccaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 12 60 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 
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gtcctggccc 


acatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgggga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccggggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacgacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagt 


ga 




2502 



<210> 40 
<211> 833 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 40 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 
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Val Val Phe Asp Ala Lys Ala 
65 70 



Pro Ser Phe Arg His Glu Ala Tyr Gly 
75 80 



Gly Tyr Lys Ala Gly Arg Ala 
85 



Pro Thr Pro Glu Asp Phe Pro Arg Gin 
90 95 



Leu Ala Leu lie Lys Glu Leu 
100 



Val Asp Leu Leu Gly Leu Ala Arg Leu 
105 110 



Glu Val Pro Gly Tyr Glu Ala 
115 



Asp Asp Val Leu Ala Ser Leu Ala Lys 
120 125 



Lys Ala Glu Lys Glu Gly Tyr 
130 135 



Glu Val Arg lie Leu Thr Ala Asp Lys 
140 



Asp Leu Tyr Gin Leu Leu Ser 
145 150 



Asp Arg lie His Val Leu His Pro Glu 
155 160 



Gly Tyr Leu lie Thr Pro Ala 
165 



Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
170 175 



Pro Asp Gin Trp Ala Asp Tyr 
180 



Arg Ala Leu Thr Gly Asp Glu Ser Asp 
185 190 



Asn Leu Pro Gly Val Lys Gly 
195 



lie Gly Glu Lys Thr Ala Arg Lys Leu 
200 205 



Leu Glu Glu Trp Gly Ser Leu 
210 215 



Glu Ala Leu Leu Lys Asn Leu Asp Arg 
220 



Leu Lys Pro Ala lie Arg Glu 
225 230 



Lys lie Leu Ala His Met Asp Asp Leu 
235 240 



Lys Leu Ser Trp Asp Leu Ala 
245 



Lys Val Arg Thr Asp Leu Pro Leu Glu 
250 255 



Val Asp Phe Ala Lys Arg Arg 
260 



Glu Pro Asp Arg Glu Arg Leu Arg Ala 
265 270 



Phe Leu Glu Arg Leu Glu Phe 
275 



Gly Ser Leu Leu His Glu Phe Gly Leu 
280 285 



Leu Glu Ser Pro Lys Ala Leu 
290 295 



Glu Glu Ala Pro Trp Pro Pro Pro Glu 
300 



Gly Ala Phe Val Gly Phe Val 
305 310 



Leu Ser Arg Lys Glu Pro Met Trp Ala 
315 320 



Asp Leu Leu Ala Leu Ala Ala 
325 



Ala Arg Gly Gly Arg Val His Arg Ala 
330 335 



Pro Glu Pro Tyr Lys Ala Leu 
340 



Arg Asp Leu Lys Glu Ala Arg Gly Leu 
345 350 



Leu Ala Lys Asp Leu Ser Val 
355 



Leu Ala Leu Arg Glu Gly Leu Gly Leu 
360 365 



Pro Pro Gly Asp Asp Pro Met 
370 375 



Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
380 



Asn Thr Thr Pro Glu Gly Val 
385 390 



Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
395 400 



130 



Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 
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Arg Val Lys Ser Val Arg Glu Ala 
740 

Pro Val Arg Gly Thr Ala Ala Asp 
755 760 

Leu Phe Pro Arg Leu Glu Glu Met 
770 775 

His Asp Glu Leu Val Leu Glu Ala 
785 790 

Ala Arg Leu Ala Lys Glu Val Met 
805 

Pro Leu Glu Val Glu Val Gly lie 
820 

Glu 



Ala Glu Arg Met Ala Phe Asn Met 
745 750 

Leu Met Lys Leu Ala Met Val Lys 
765 

Gly Ala Arg Met Leu Leu Gin Val 
780 

Pro Lys Glu Arg Ala Glu Ala Val 
795 800 

Glu Gly Val Tyr Pro Leu Ala Val 
810 815 

Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



<210> 41 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 41 

cacgaattcc gaggcgatgc ttccgctc 2 8 



<210> 42 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 42 

tcgacgtcga ctaacccttg gcggaaagcc 



<210> 43 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 43 

gcatcgcctc ggaattcatg gtc 23 



<210> 44 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 44 

atagccatgg tggagcggcc gctctcccgg 



<210> 45 

<211> 33 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 45 

aagcgtcgac tcaatcctgc ttcgcctcca gcc 



<210> 46 

<211> 32 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 46 

aatcgaattc accccacttt ttgacctgga gg 



<210> 47 

<211> 21 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 47 

ccgggagagc ggccgctcca c 



<210> 48 
<211> 2508 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 48 

atggaattca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 
ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 12 0 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 24 0 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 3 00 
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aagcggctgg 


tggacc t tct 


gggcctggtc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


ya.cgu.ccL.gg 


gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


acgggagac c 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 


gggac c c t gg 


tcaccccaaa 


ggacgtccag 


gagaagtacg 


gggtgccccc 


ggagcgctgg 


540 


gtggacttcc 


gcgccctcac 


gggggaccgc 


tcggacaaca 


tccccggggt 


ggcggggata 


600 


ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


aacc tggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


720 


cuccacc cct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcggagg 


gccgggtcca 


ccgggcaaca 


1020 


agcccggttg 


&ggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc 


caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc 


cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg 


ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt 


ccagaacctc 


1260 


tttccccggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg 


gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct 


tctggaggcc 


1380 


cfcctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt 


ccgtttggcc 


1440 


ggccaccccu 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt 


tgacgagctg 


1500 


ggcctcaccc 


cggtgggccg 


gacggagaag 


acgggcaagc 


gctccaccgc 


ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta 


ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc 


gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc 


cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg 


caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga 


gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg 


gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga 


tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc 


ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag 


gcggaggcct 


ttattgagcg 


ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg 


gacgcggggc 


2160 
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tacgtggaga ccctgttcgg caggaggcgc tatgtgcccg acctggcctc ccgggtccgc 222 0 
tcggtgcggg aggcggcgga gcggatggcc ttcaacatgc ccgtgcaggg caccgccgcc 22 80 
gacctgatga agatcgccat ggtcaagctc ttccccaggc taaagcccct gggggcccac 234 0 
ctcctcctcc aagtgcacga cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 24 00 
gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 2460 
gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggattga 2508 



<210> 49 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 49 

actggaattc ctgcccctct ttgagcccaa g 



<210> 50 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 50 

aacagtcgac ctaggccttg gcggaaagcc 



<210> 51 
<211> 2499 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 51 

atggaattcc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 6 0 
ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 
gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 240 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 3 00 
atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 360 
ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 420 
gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 4 80 



135 



ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 540 
taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 600 
aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 660 
gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 72 0 
ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 7 80 
cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 840 
ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 90 0 
ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 960 
cttaacgcct tggccgccgc ctgggaggga agggtttacc gggcggagga tcccttggag 1020 
gccttgcggg ggcttgggga ggtgaggggg cttttggcca aggacctggc ggtgctggcc 1080 
ctgagggaag ggattgccct ggcaccgggc gacgacccca tgctcctcgc ctacctcctg 1140 
gatccttcca acaccgcccc cgaaggggta gcccggcgct acggggggga gtggaccgag 12 00 
gaggcggggg aaagggcgct gctttccgaa aggctttacg ccgccctcct ggagcggctt 1260 
aagggggagg agaggcttct ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 132 0 
ctggcccaca tggaggccac gggggtacgg ttggatgtgg cctacttaaa ggccctttcc 1380 
ctggaggtgg aggcggagat aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 1440 
cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 1500 
cccgccatcg gcaagacgga gaagacgggc aagcgctcca ccagcgccgc cgttttggag 1560 
gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaccggga gctttccaag 162 0 
ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 1680 
cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 174 0 
ctgcaaaata tccccgtgcg cacccctttg ggccagcgga tccgccgggc cttcgtggcc 180 0 
gaggaggggt ggaggctggt ggttttggac tacagccaga ttgagctcag ggtcctggcg 186 0 
cacctttccg gggacgagaa cctaatccgg gtcttccagg agggccagga catccacacc 192 0 
cagacggcca gctggatgtt cggcgtgccc ccagaggccg tggattccct gatgcgccgg 1980 
gcggccaaga ccatcaactt cggcgtcctc tacggcatgt ccgcccaccg gctttcggga 2 040 
gagctggcca tcccctacga ggaggcggtg gccttcatcg agcggtattt ccagagctac 2100 
cccaaggtgc gggcctggat tgagaaaacc ctggcggaag gacgggaacg gggctatgtg 2160 
gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 2220 
cgggaggcag cggagcgcat ggccttcaac atgccggtcc aggggaccgc cgcggatttg 2280 
atgaaactgg ccatggtgaa gctctttccc aggcttcagg agctgggggc caggatgctt 2340 
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ttgcaggtgc acgacgaact ggtcctcgag gctcccaagg agcaagcgga ggaagtcgcc 2400 
caggaggcca agcggaccat ggaggaggtg tggcccctga aggtgccctt ggaggtggaa 2460 
gtgggcatcg gggaggactg gctttccgcc aaggcctag 2499 



<210> 52 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 52 

cgatctcctc ggccacctcc 2 0 



<210> 53 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 53 

ggcggtgccc tggacgggca 2 0 



<210> 54 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 54 

ccagctcgtt gtggacctga 2 0 



<210> 55 
<211> 2505 
<212> DNA 

<213> Thermus aquaticus 

<220> 

<221> CDS 

<222> (1) . . (2499) 

<400> 55 

atg aat teg ggg atg ctg ccc etc ttt gag ccc aag ggc egg gtc etc 48 
Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

ctg gtg gac ggc cac cac ctg gec tac cgc acc ttc cac gee ctg aag 96 
Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

ggc etc acc acc age egg ggg gag ccg gtg cag gcg gtc tac ggc ttc 144 
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Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

gcc aag age etc etc aag gee etc aag gag gac ggg gac gcg gtg ate 192 
Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 



gtg gtc ttt gac gcc aag gcc ccc tec ttc cgc cac gag gcc tac ggg 
Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 
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ggg tac aag gcg ggc egg gcc ccc acg ccg gag gac ttt ccc egg caa 2 88 
Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

etc gcc etc ate aag gag ctg gtg gac etc ctg ggg ctg gcg cgc etc 33 6 
Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

gag gtc ccg ggc tac gag gcg gac gac gtc ctg gcc age ctg gcc aag 3 84 
Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

aag gcg gaa aag gag ggc tac gag gtc cgc ate etc ace gcc gac aaa 432 
Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

gac ctt tac cag etc ctt tec gac cgc ate cac gtc etc cac ccc gag 480 
Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

ggg tac etc ate acc ccg gcc tgg ctt tgg gaa aag tac ggc ctg agg 52 8 
Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

ccc gac cag tgg gcc gac tac egg gcc ctg acc ggg gac gag tec gac 576 
Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

aac ctt ccc ggg gtc aag ggc ate ggg gag aag acg gcg agg aag ctt 624 
Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

ctg gag gag tgg ggg age ctg gaa gcc etc etc aag aac ctg gac egg 672 
Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

ctg aag ccc gcc ate egg gag aag ate ctg gcc cac atg gac gat ctg 72 0 
Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

aag etc tec tgg gac ctg gcc aag gtg cgc acc gac ctg ccc ctg gag 768 
Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

gtg gac ttc gcc aaa agg egg gag ccc gac egg gag agg ctt agg gcc 816 
Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

ttt ctg gag agg ctt gag ttt ggc age etc etc cac gag ttc ggc ctt 864 
Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 
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960 



1008 



1056 



ctg gaa age ccc aag gec ctg gag gag gec ccc tgg ccc ccg ccg gaa 912 
Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

ggg gec ttc gtg ggc ttt gtg ctt tec cgc aag gag ccc atg tgg gec 
Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

gat ctt ctg gec ctg gee gee gec agg ggg ggc egg gtc cac egg gec 
Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

ccc gag cct tat aaa gee etc agg gac ctg aag gag gcg egg ggg ctt 
Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

etc gec aaa gac ctg age gtt ctg gee ctg agg gaa ggc ctt ggc etc 1104 
Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

ccg ccc ggc gac gac ccc atg etc etc gec tac etc ctg gac cct tec 1152 
Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

aac acc acc ccc gag ggg gtg gec egg cgc tac ggc ggg gag tgg acg 12 0 0 
Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

gag gag gcg ggg gag egg gec gec ctt tec gag agg etc ttc gee aac 1248 
Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

ctg tgg ggg agg ctt gag ggg gag gag agg etc ctt tgg ctt tac egg 1296 
Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

gag gtg gag agg ccc ctt tec get gtc ctg gec cac atg gag gee acg 1344 
Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

999 gtg cgc ctg gac gtg gee tat etc agg gee ttg tec ctg gag gtg 13 92 
Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

gee gag gag ate gec cgc etc gag gee gag gtc ttc cgc ctg gec ggc 144 0 
Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

cac ccc ttc aac etc aac tec egg gac cag ctg gaa agg gtc etc ttt 
His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

gac gag eta ggg ctt ccc gee ate ggc aag acg gag aag acc ggc aag 1536 
Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

cgc tec acc age gec gec gtc ctg gag gee etc cgc gag gee cac ccc 1584 
Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

ate gtg gag aag ate ctg cag tac egg gag etc acc aag ctg aag age 1632 
He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 



1488 
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530 535 540 

acc tac att gac ccc ttg ccg gac etc ate cac ccc agg acg ggc cgc 
Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 



ggc gac gag aac ctg ate egg gtc ttc cag gag ggg egg gac ate cac 
Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 



gtg gag acc etc ttc ggc cgc cgc cgc tac gtg cca gac eta gag gec 
Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



ccc gtc cag ggc acc gee gee gac etc atg aag ctg get atg gtg aag 
Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 
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etc cac acc cgc ttc aac cag acg gee acg gee acg ggc agg eta agt 172 8 
Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

age tec gat ccc aac etc cag aac ate ccc gtc cgc acc ccg ctt ggg 1776 
Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

cag agg ate cgc egg gee ttc ate gee gag gag ggg tgg eta ttg gtg 1824 
Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

gec ctg gac tat age cag ata gag etc agg gtg ctg gee cac etc tec 1872 
Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 
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acg gag acc gec age tgg atg ttc ggc gtc ccc egg gag gec gtg gac 1968 
Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

ccc ctg atg cgc egg gcg gec aag acc ate aac ttc ggg gtc etc tac 2016 
Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

ggc atg teg gec cac cgc etc tec cag gag eta gee ate cct tac gag 2 064 
Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

gag gee cag gec ttc att gag cgc tac ttt cag age ttc ccc aag gtg 2112 
Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

egg gec tgg att gag aag acc ctg gag gag ggc agg agg egg ggg tac 2160 
Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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egg gtg aag age gtg egg gag gcg gee gag cgc atg gee ttc aac atg 2256 
Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 
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etc ttc ccc agg ctg gag gaa atg ggg gec agg atg etc ctt cag gtc 2352 
Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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cac aac gag ctg gtc etc gag gec cca aaa gag agg gcg gag gec gtg 24 00 
His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

gec egg ctg gec aag gag gtc atg gag ggg gtg tat ccc ctg gee gtg 244 8 
Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

ccc ctg gag gtg gag gtg ggg ata ggg gag gac tgg etc tec gee aag 2496 
Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



gag tgatag 
Glu 



<210> 56 
<211> 833 
<212> PRT 

<213> Thermus aquaticus 
<400> 56 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
7 85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 



2505 
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210 

Leu Lys Pro Ala lie 
225 

Lys Leu Ser Trp Asp 
245 



Val Asp Phe Ala Lys 
260 

Phe Leu Glu Arg Leu 
275 

Leu Glu Ser Pro Lys 
290 

Gly Ala Phe Val Gly 
305 

Asp Leu Leu Ala Leu 
325 

Pro Glu Pro Tyr Lys 
340 

Leu Ala Lys Asp Leu 
355 

Pro Pro Gly Asp Asp 
370 

Asn Thr Thr Pro Glu 
385 

Glu Glu Ala Gly Glu 
405 

Leu Trp Gly Arg Leu 
420 

Glu Val Glu Arg Pro 
435 

Gly Val Arg Leu Asp 
450 

Ala Glu Glu He Ala 
465 

His Pro Phe Asn Leu 
485 

Asp Glu Leu Gly Leu 
500 

Arg Ser Thr Ser Ala 
515 

He Val Glu Lys He 
530 



215 

Arg Glu Lys He Leu Ala 
230 235 

Leu Ala Lys Val Arg Thr 
250 

Arg Arg Glu Pro Asp Arg 
265 

Glu Phe Gly Ser Leu Leu 
280 

Ala Leu Glu Glu Ala Pro 
295 

Phe Val Leu Ser Arg Lys 
310 315 

Ala Ala Ala Arg Gly Gly 
330 

Ala Leu Arg Asp Leu Lys 
345 

Ser Val Leu Ala Leu Arg 
360 

Pro Met Leu Leu Ala Tyr 
375 

Gly Val Ala Arg Arg Tyr 
390 395 

Arg Ala Ala Leu Ser Glu 
410 

Glu Gly Glu Glu Arg Leu 
425 

Leu Ser Ala Val Leu Ala 
440 

Val Ala Tyr Leu Arg Ala 
455 

Arg Leu Glu Ala Glu Val 
470 475 

Asn Ser Arg Asp Gin Leu 
490 

Pro Ala He Gly Lys Thr 
505 

Ala Val Leu Glu Ala Leu 
520 

Leu Gin Tyr Arg Glu Leu 
535 



220 

His Met Asp Asp Leu 
240 

Asp Leu Pro Leu Glu 
255 

Glu Arg Leu Arg Ala 
270 

His Glu Phe Gly Leu 
285 

Trp Pro Pro Pro Glu 
300 

Glu Pro Met Trp Ala 
320 

Arg Val His Arg Ala 
335 

Glu Ala Arg Gly Leu 
350 

Glu Gly Leu Gly Leu 
365 

Leu Leu Asp Pro Ser 
380 

Gly Gly Glu Trp Thr 
400 

Arg Leu Phe Ala Asn 
415 

Leu Trp Leu Tyr Arg 
430 

His Met Glu Ala Thr 
445 

Leu Ser Leu Glu Val 
460 

Phe Arg Leu Ala Gly 
480 

Glu Arg Val Leu Phe 
495 

Glu Lys Thr Gly Lys 
510 

Arg Glu Ala His Pro 
525 

Thr Lys Leu Lys Ser 
540 
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Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arq Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu 



<210> 57 

<211> 26 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 57 

caggaggagc tcgttgtgga cctgga 



<210> 58 
<211> 836 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lvs Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 HO 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arq Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arq Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
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245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arq Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arq Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 
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Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly 
835 



<210> 59 
<211> 2511 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 59 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
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ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


gccgggtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


catggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 
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gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 
gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtta g 2511 

<210> 60 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 60 

tctagaggat ctatcagtgg tggtggtggt ggtgctcctt ggcggagagc 

<210> 61 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 61 

tgcctgcagg tcgacgctag ctagtggtgg tggtggtggt gacccttggc ggaaagcc 

<210> 62 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 

<220> ^ , 

<223> Description of Artificial Sequence: Synthetic 

<400> 62 ^ 
atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 



50 



58 
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gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggg 


ggccgggtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


ccaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


acatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


c g a gg a ggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 
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ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 24 0 0 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 63 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 63 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 
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Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 



151 



Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 64 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Synthetic 



<400> 64 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


gccgggtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


catggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaqgagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 
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tccgacccca 


accugcagaa 


catccccgt c 


cgcaccccc l 


tgggecagag gatccgccgg 


l ft n n 


gccttcgtgg 


ccgaggcggg 


L.uyyyoy tug 


gtggcccugg 


actatageca 


gatagagctc 


1860 


cgcgtcct eg 


z^i /i /-i — * ^ /*1 4— /"■* 

CCCaCCCCUC 


eggggacgaa 


■a ^ ft /"i 4— /-«r ^ 4— *-» ^ 


gggtcttcca 


ggaggggaag 


1920 


ga.ca.tcca.c3. 


cccagaccgc 


aagccggacg 




ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggegg c c aa 


gaeggtgaac 


4— 4— j^j jt/t /-i /— t 4— j-i /i 

txcggcgt cc 


tetaeggcat 


gtccgcccat 


9 D4n 

Z< U *± VJ 


aggctctccc 


aggagc 1 1 gc 


catcccctac 


gaggaggegg 


tggectttat 


agagegctae 


91 no 


ttccaaagct 


tccccaaggt 


gegggee tgg 


auagaaaag a 


ccctggagga 


ggggaggaag 


9 1 6 n 


eggggctacg 


tggaaaccct 


cr. teggaaga 


aggegcuacg 


tgcccgacct 


caacgcccgg 


9 9 9 n 
z. z. Zi u 


gtgaagagcg 


t cagggaggc 


cgcggagcgc 


--\ 4— /— i /i 4— 4— -3 

atygccttcd 


acatgcccgt 


ccagggcacc 


9 9 R n 
z. z o u 


gccgccgacc 


tcacgaagcu 


cgccauggug 




cccgcctccg 


ggagatgggg 


Zi J *± u 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


eggctttgge 


caaggaggee 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 65 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 65 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 
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Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 
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Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 
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Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 66 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 66 

gccgccaggg gcggccgcgt ccaccgggcc 3 0 



<210> 67 

<211> 31 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 67 

gcctgcaggg gcggccgcgt gcaccggggc a 31 



<210> 68 

<211> 26 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 68 

ctcctggacc cttcgaacac cacccc 2 6 



<210> 69 

<211> 23 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 69 

gtcctggccc atatggaggc cac 23 



<210> 70 

<211> 2526 

<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Synthetic 



<400> 70 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


□ ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


y.g ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


. : j gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


O gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


^ atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


fO gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


X gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


O gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 
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■f— /~t s~i f~r *-% f-\ ft j*i ^ 


accuy cayaa 


catccccgt c 


cgcaccccct 


tgggecagag 


gatccgccgg 


1800 




ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatageca 


gatagagctc 


1860 


cycyuccLCy 


cccacct etc 


eggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


yclUcAUCec±l_.a. 


cccay acege 


aagctggatg 


ttcggcgtcc 


c c c eggagg c 


cgtggacccc 


1980 


+- (TO ■!— Cf/-i <T /-i /-i 

uLyaLyLyLO 


yyyeyy ccaa 


gaeggtgaac 


ttcggcgtcc 


tetaeggcat 


gtccgcccat 


204 0 


frcTf 1 f~ t" 1 1* f 1 z" 1 r~* 


c*y y a.y(_c tyc 


/~» ^ \~ r* i~~t /~i /~i ^ ^ 

CdtCCCCLdC 


gaggaggegg 


tggectttat 


agagegctae 


210 0 


L, L. k_ ctdcty o l_ 


cot.ee aay gc 


gegggecugg 


atagaaaaga 


ccctggagga 


9999 a 99 aa 9 


2160 


r'ncinrrr'l" a /-i/-t 

t -3yyy CLdC y 


"H Tf~f T\ /—I /—I 4— 

tyydadCCCt 


c ll eggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


2220 


y tyddydycy 


ucagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


("T^" , f" , f"Tf*' , f~'0 , afT'* 

yu^yLuyaLL 


4- /-1 -3 -f- /"fa a f~if* \- 

utatyddyct 


cgccauggcg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


eggctttgge 


caaggaggee 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 71 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 71 
atgaattegg 


ggatgetgee 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggee 


tcaccaccag 


ceggggggag 


120 


cc 99tgcagg 


eggtctaegg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


egaggeggae 


360 


gaegtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


eggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


eggegaggaa 


gcttctggag 


gagtggggga 


gectggaage 


cctcctcaag 


660 


aacctggacc 


ggctgaagee 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 
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dciy V_ L. ^ L. ^ t_ 


gggaccuggc 


caaggcgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 




cty i^^y ctov_y 


ft/~r a fT*a /~r/~v /~t 4- 4- 

gSayayyctt 


agggecttte 


t99 a 9 a 99 ct 


tgagtttggc 


840 




acyayutcyy 


/-i i^t 4— 4— f~\ 4- /-rt^r" \ 

ccuucc ggaa 


age c c c aagg 


ccctggagga 


ggccccctgg 


900 


f* p p r* c cs c c* an 


ctctyyy y u u 




/T 4— f-*f +* 4— -1— y-t « 

gcgcu euccc 


geaaggagee 


catgtgggcc 


960 


y 1>^^L>^/ ^ zd 




^yLLd.yyyyc 


ggccgcgt cc 


accgggcccc 


egagecttat 


1020 




yyycicctgaa 


ggaggegegg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


geccugaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


+~ rtfi -3 /~i /~i 4— 4— 

CLyydCCCtt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


getaeggegg 


ggagtggacg 


1200 


rra t~~f n ftrrrtfT/-r 

gaggaggegg 


999 a 9 c gggc 


cgccctttcc 


ga-gaggctct 


tcgccaacct 


gtgggggagg 


1260 


t-ttyaggggg 


a rrrf a fra rtrir* i~ 

0-gga.gaggc t 


/-i >-i +— 4— ^ /^ri^r 4* +*■ 

CCttuyyCCt 


tacegggagg 


tggagaggee 


cctttccgct 


1320 




a j- at* rrrrarrrfP 
di_.ci.L-yy cty y 


cac 9yy9y >-g 


cgcctggacg 


tggectatet 


ca 999 cc ttg 


1380 


lu^L L.y y ct^M 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


a /"■» /~i 4— 4— /—i --i 


acc ucaaccc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


teggcaagae 


ggagaagacc 


ggcaageget 


ccaccagcgc 


cgccgtcctg 


1560 


gayy c cctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


cicty v_ uy day d 


yCaCCtaCaU 


tgacccct tg 


ccggacctca 


tccaccccag 


g a cgggccgc 


1680 




y C U UCaaCC a 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 




a ^ a +~ /~< ft rft - 
atatoCCCyc 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gc cgaggagg 


gg^ggcuaut: 


ggtggccctg 


gaetatagee 


agatagagct 


cagggtgctg 


1860 


f~f /-I f"1 /"< a y-1 4— /-I 4— 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


cicggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


y y y ^ y y »■ 


a /~* <~i a +~ /~i ^ 
dyaOCaLCaa 


etteggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


a f*rs~r a i^T /-i 4— — i /— r 

Layydyctay 


ccaucccc ca 


egaggaggee 


caggccttca 


ttgagegcta 


ctttcagagc 


2100 


l. u. w ^ o ct cty y 


tgcgggccug 


gattgagaag 


accctggagg 


a 999 ca 99 a 9 


gegggggtae 


2160 


y i-yy«yaccc 


tCLtcyyccy 


ccgccgctac 


gtgccagacc 


tagaggeccg 


ggtgaagagc 


2220 


it4— rrr ifrrtfr ^ ft ft 

g-cgegggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


ta ggggagga 


ctggctctcc 


gecaaggage 


accaccacca 


ccaccac 


2517 



<210> 72 



160 



<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 72 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 
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His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 73 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 73 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
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Val Val Phe Asp Ala 
65 

Gly Tyr Lys Ala Gly 
85 

Leu Ala Leu lie Lys 
100 

Glu Val Pro Gly Tyr 
115 

Lys Ala Glu Lys Glu 
130 

Asp Leu Tyr Gin Leu 
145 

Gly Tyr Leu lie Thr 
165 

Pro Asp Gin Trp Ala 
180 

Asn Leu Pro Gly Val 
195 

Leu Glu Glu Trp Gly 
210 

Leu Lys Pro Ala lie 
225 

Lys Leu Ser Trp Asp 
245 

Val Asp Phe Ala Lys 
260 

Phe Leu Glu Arg Leu 
275 

Leu Glu Ser Pro Lys 
290 

Gly Ala Phe Val Gly 
305 

Asp Leu Leu Ala Leu 
325 

Pro Glu Pro Tyr Lys 
340 

Leu Ala Lys Asp Leu 
355 

Pro Pro Gly Asp Asp 
370 



55 

Lys Ala Pro Ser Phe Arg 
70 75 

Arg Ala Pro Thr Pro Glu 
90 

Glu Leu Val Asp Leu Leu 
105 

Glu Ala Asp Asp Val Leu 
120 

Gly Tyr Glu Val Arg lie 
135 



Leu Ser Asp Arg lie His 
150 155 

Pro Ala Trp Leu Trp Glu 
170 

Asp Tyr Arg Ala Leu Thr 
185 

Lys Gly lie Gly Glu Lys 
200 

Ser Leu Glu Ala Leu Leu 
215 

Arg Glu Lys lie Leu Ala 
230 235 

Leu Ala Lys Val Arg Thr 
250 

Arg Arg Glu Pro Asp Arg 
265 

Glu Phe Gly Ser Leu Leu 
280 

Ala Leu Glu Glu Ala Pro 
295 

Phe Val Leu Ser Arg Lys 
310 315 

Ala Ala Ala Arg Gly Gly 
330 

Ala Leu Arg Asp Leu Lys 
345 

Ser Val Leu Ala Leu Arg 
360 

Pro Met Leu Leu Ala Tyr 
375 



60 

His Glu Ala Tyr Gly 
80 

Asp Phe Pro Arg Gin 
95 

Gly Leu Ala Arg Leu 
110 

Ala Ser Leu Ala Lys 
125 

Leu Thr Ala Asp Lys 
140 

Val Leu His Pro Glu 
160 

Lys Tyr Gly Leu Arg 
175 

Gly Asp Glu Ser Asp 
190 

Thr Ala Arg Lys Leu 
205 

Lys Asn Leu Asp Arg 
220 

His Met Asp Asp Leu 
240 

Asp Leu Pro Leu Glu 
255 

Glu Arg Leu Arg Ala 
270 

His Glu Phe Gly Leu 
285 

Trp Pro Pro Pro Glu 
300 

Glu Pro Met Trp Ala 
320 

Arg Val His Arg Ala 
335 

Glu Ala Arg Gly Leu 
350 

Glu Gly Leu Gly Leu 
365 

Leu Leu Asp Pro Ser 
380 
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Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 74 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 74 



atgaattcgg ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 
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agcctcctcc acgagttcgg ccttctggaa 
cccccgccgg aaggggcctt cgtgggcttt 
gatcttctgg ccctggccgc cgccaggggc 
9cggggctaa aggacctcaa ggaggtccgg 
gcctcgaggg aggggctaga cctcgtgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggacgccg cccaccgggc cctcctctcg 
ctcgaggggg aggagaagct cctttggctc 
gtcctggccc atatggaggc caccggggta 
tccctggagc ttgcggagga gatccgccgc 
caccccttca acctcaactc ccgggaccag 
cttcccgcct tggggaagac gcaaaagaca 
gaggccctac gggaggccca ccccatcgtg 
aagctcaaga acacctacgt ggaccccctc 
ctccacaccc gcttcaacca gacggccacg 
aacctgcaga acatccccgt ccgcaccccc 
gccgaggcgg gttgggcgtt ggtggccctg 
gcccacctct ccggggacga aaacctgatc 
acccagaccg caagctggat gttcggcgtc 
cgggcggcca agacggtgaa cttcggcgtc 
caggagcttg ccatccccta cgaggaggcg 
ttccccaagg tgcgggcctg gatagaaaag 
gtggaaaccc tcttcggaag aaggcgctac 
gtcagggagg ccgcggagcg catggccttc 
ctcatgaagc tcgccatggt gaagctcttc 
ctcctccagg tccacaacga gctcctcctg 
gcggctttgg ccaaggaggc catggagaag 
gaggtgggga tgggggagga ctggctttcc 

<210> 75 

<211> 839 

<212> PRT 

<213> Artificial Sequence 



agccccaagg ccctggagga ggccccctgg 900 
gtgctttccc gcaaggagcc catgtgggcc 960 
ggccgcgtgc accgggcagc agaccccttg 102 0 
ggcctcctcg ccaaggacct cgccgtcttg 1080 
ggggacgacc ccatgctcct cgcctacctc 114 0 
gtggcgcggc gctacggggg ggagtggacg 12 00 
gagaggctcc atcggaacct ccttaagcgc 12 60 
taccacgagg tggaaaagcc cctctcccgg 132 0 
cggcgggacg tggcctacct tcaggccctt 13 80 
ctcgaggagg aggtcttccg cttggcgggc 1440 
ctggaaaggg tgctctttga cgagcttagg 1500 
ggcaagcgct ccaccagcgc cgcggtgctg 1560 
gagaagatcc tccagcaccg ggagctcacc 162 0 
ccaagcctcg tccacccgag gacgggccgc 1680 
gccacgggga ggcttagtag ctccgacccc 1740 
ttgggccaga ggatccgccg ggccttcgtg 1800 
gactatagcc agatagagct ccgcgtcctc 1860 
agggtcttcc aggaggggaa ggacatccac 1920 
cccccggagg ccgtggaccc cctgatgcgc 198 0 
ctctacggca tgtccgccca taggctctcc 2040 
gtggccttta tagagcgcta cttccaaagc 2100 
accctggagg aggggaggaa gcggggctac 2160 
gtgcccgacc tcaacgcccg ggtgaagagc 2220 
aacatgcccg tccagggcac cgccgccgac 2280 
ccccgcctcc gggagatggg ggcccgcatg 2340 
gaggcccccc aagcgcgggc cgaggaggtg 2400 
gcctatcccc tcgccgtgcc cctggaggtg 2460 
gccaagggtc accaccacca ccaccac 2517 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 75 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
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305 



310 



315 



320 



Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 
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Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 



<210> 76 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 76 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

170 



a t* p p i~ p a c per 

W4 I—- *w v^, V^. ^wj 


ppna c r*nrrt^i 


v_.v_.U-V_ l_clv_v_clcl 


c_ icy tctc eg 


accgcgtcgc 


cgtcctccac 


480 


pp p cr 3 00 p/ p p 


appfpahpar 

O. ^ L»- UV_.Ctt_.l_.dv_ 


/-1 r*r*rrrt ^rrt~ rrrr 

cceggagugg 


c u u egggaga 


agtaeggect 


caggceggag 


540 


<-c*y t-yyy L-yy 


apf-i-rt c*rrr , rrr* 
gl v_ l. l. v_ v_y oy l 


f~t /—\ 4— /-t i^t4 — rrrrrrrt 

v_v_ i_v_y uy y y y 


gacccctccg 


acaacct ccc 


eggggt caag 


600 


yyt-dt_.v_yyyy 


a rfa anan r*rrr* 

cty cicty cLoc*y l. 


r* r* 4~~ /^i *^ "^1 /^c/*! 4 — « 

CCtCaayCEC 


ctcaaggagt 


9999 aa 9 cct 


ggaaaacctc 


660 


v_ L.^aayaauu 


1~ prrra r*rrcrrr\~ 

^yyciouyyy l 


n nr* r* T» /^r ^ 

aadyCCayaa 


aacguceggg 


agaaga t caa 


ggcccacctg 


720 


y CIClVj l_ V_ CI 


yyut^LULUL 


nrra <^t /^» 4— /~> +■ r*ri 

yyayotctcc 


/™1 i^f /^f /"T /^T — * 

^gy y ^ y ^ 9^ ^> 


ccgacctccc 


cctggaggtg 


780 


aacrt rarrr 1 
t* v_ \w i_ ^~*y >w *w ^ 


;q rrrrrrrrr*rrrrrrzi 

^yyyy^yyya- 


rrr 1 ^ prra r* r 1 rrrr 

yLLLydu-tyy 


/^T rrrrrmr* 4— 4— — * 

9 a 9999 cl::l - a 


gggccttcct 


ggagaggctg 


840 


93-9" 1 1 c gg c a. 


crept" ppt* ppa 


^ydy l, Luyyt 


/-t 4- r-i 4— nrra rrrr 

o u c_ t_ u ggagg 


cccccycccc 


cctggaggag 


Q A Pi 

you 


ctp p p p p 1~ crcrp 


p pp r 1 rr pp nrr a 
uuu^y ^uy yet 


a rrrrrrrrr' r>\~ ^ r* 

dyy y y v_,v_ u i_ v_ 


gtgggcctcg 


tcctzctcccg 


ccccgagccc 


960 


a tot" cracrpcrcr 
ai_y >-yyy v_yy 


CLy V_ t_ L. CLCLCLy V_ 


PP^nprpnnr'P 

v_v_ uy y v_v_y v-v_ 


4- r~r/~t -a rrrrrtrmrt 

L-gcaggggcg 


gccgcgtcca 


ccgggccccc 


102 0 


y ciy k_ v_ i_ L. d L. ct 


ddy v_t_v_ucdg 


ggacctgaag 


gaggegeggg 


ggcttctcgc 


caaagacctg 


1080 


c^y <-y LULLyy 


v_v_v_ i_y ciyyy d 


aggece uggc 


ctcccgcccg 


gcgacgaccc 


catgctcctc 


1140 


y^v L-0.V_.V_* l_. V_ V_ 


i_.yydv_v_v_i_ ll. 


y ddCaCCdCC 


cccgaggggg 


tggcccggcg 


etaeggeggg 


1200 


y cty i-yyci(-yy 


a <™T(Ta nrtr'nrrrr 

^yy^ygeggg 


ggagegggee 


gccctttccg 


agaggctctt 


cgccaacctg 


1260 


^yyyyy^yy ^ 


i_ uy ciyyyyy ci 


rfr~r ^ p;a nrrri 4- ^ 


^i 4- -f- 4- /-T^r y~i 4- 4- 4- 

curuygcttt 


acegggaggt 


ggagaggece 


1320 


pi~ "h t" r 1 pctt 1 1~ n 


j-ppt - ctrir* r*r , s 
tyyLLLct 


uduggciggcc 


ac 9ggggtgc 


gcctggacgt 


ggcctatctc 


1380 




/"-i /—i 4— i~if~r ■— 1 4— 

t_- uc. L.yy ciggt. 


ggecgaggag 


atcgcccgcc 


tegaggcega 


ggtcttccgc 


1440 


yy^'^yy^^' 




/~1 4— /*f * « +- « 

ccucaacucc 


cgggaccagc 


tggaaagggt 


cctctttgac 


1500 


y ^y ^- <- «.y y y *— 


4- 4- nnpnppah 
l. m.v_v_yu.L.ci.U 


r* rrrtr* a -r> /^r -a 

t-ygcdcigdcg 


gagaagaccg 


gcaagcgctc 


caccagcgcc 


1560 


Qcccrtcctacr 


cLy yv_.v_.v_, v_-v_v_>y 


v-.ydyyv_v_v_.dt_. 


/-I /—I fl -»i 4— yr-, ^-w 4— i^r/-~r 

LLLdtcyuyy 


agaaga c ecu 


geagtacegg 


t r o a 

162 0 


CiaCfPt" P3 PP3 


ciy v_ uycLctyciy 


v_ d V_ V_ t. a. C d t. L. 


gacccc uege 


cggacctcat 


ccaccccagg 


1680 


apcfcrcrppcrpp 


hppapannprr 
L.v_v_civ_cLv_v_v_.y 


t_ l_ LOaaCCdy 


acggccacgg 


ccacgggcag 


gctaagtagc 


1740 


L.v_v_ydL.v_t_v_d 


acc uccagaa 


catccccgtc 


cgcaccccgc 


ttgggcagag 


gatccgccgg 


1800 


elf* 4- 4- r*< a 4~ r*rr 

ytLt u v_d l. t_y 


ccgaggaggg 


gtggctattg 


gtggccctgg 


actatageca 


gatagagctc 


1860 


fTrrrrt" rrr* t~ rrrr 

a-ggg^gexgg 


cccaccccuc 


eggegacgag 


aacctgatcc 


gggtcttcca 


ggaggggegg 


1920 


ya.v_ciL.v_v_civ_d 


v- y y ciy d c eg c 


cagcrggatg 


ttcggcgtcc 


cccgggaggc 


cgtggacccc 


1980 


v_v_yci.v_<yv_.yv_v_ 


rrrrri r^rtrt r* r> a a 

yyy eggecaa 


gaccaccaac 


tteggggtec 


tetaeggcat 


gtcggcccac 


2040 


cgcctctccc 


aggagctagc 


catcccttac 


gaggaggece 


aggecttcat 


tgagegctae 


2100 


tttcagagct 


tccccaaggt 


gcgggcctgg 


attgagaaga 


ccctggagga 


gggcaggagg 


2160 


egggggtacg 


tggagaccct 


cttcggccgc 


cgccgctacg 


tgccagacct 


a 9 a 99 ccc 99 


2220 


gtgaagagcg 


tgegggagge 


ggccgagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 
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gccgccgacc tcatgaagct ggctatggtg aagctcttcc ccaggctgga ggaaatgggg 234 0 

gccaggatgc tccttcaggt ccacaacgag ctggtcctcg aggccccaaa agagagggcg 2400 

gaggccgtgg cccggctggc caaggaggtc atggaggggg tgtatcccct ggccgtgccc 2460 

ctggaggtgg aggtggggat aggggaggac tggctctccg ccaaggagca ccaccaccac 252 0 

caccac 2526 

<210> 77 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 77 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 



172 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 
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Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 



<210> 78 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 78 
3. t gaat t egg 


ggatgetgee 


cctctttgag 


caccacctgg 


cctaccgcac 


cttccacgcc 


ccggtgcagg 


eggtctaegg 


cttcgccaag 


gacgcggtga 


ucgcggcc tt 


tgacgccaag 


gggtacaagg 


cgggccgggc 


ccccacgccg 


aaggagctgg 


tggacctcct 


ggggctggcg 


gaegtcctgg 


ccagcctggc 


caagaaggcg 


accgccgaca 


aagaccttta 


ccagctcctt 


ggg t ac c t c a 


tcaccccggc 


ctggctttgg 


gccgacua.ee 


gggccctgac 


eggggacgag 


9999 a 9 aa 9 a 


eggegaggaa 


gcttctggag 


aacctggacc 


ggctgaagee 


cgccatccgg 


aagctctcct 


gggacctggc 


caaggtgege 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


age c tec tec 


aegagttegg 


ccttctggaa 


cccccgccgg 


aaggggcett 


cgtgggcttt 


gat c ttctgg 


ccctggccgc 


cgccaggggc 


gcggggctaa 


aggacctcaa 


ggaggtcegg 


gectcgaggg 


aggggctaga 


cctcgtgccc 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gaggacgecg 


cccaccgggc 


cctcctctcg 


ctcgaggggg 


aggagaagct 


cctttggctc 


g t c c t ggc c c 


atatggaggc 


caceggggta 


tccctggagc 


ttgcggagga 


gatccgccgc 


caccccttca 


acctcaactc 


ccgggaccag 


cttcccgcct 


tggggaagac 


gcaaaagaca 


gaggccctac 


999 a ggccca 


ccccatcgtg 


aagctcaaga 


acacctacgt 


ggaccccctc 


ctccacaccc 


gcttcaacca 


gacggccacg 


aacctgeaga 


acatccccgt 


ccgcaccccc 


gecgaggagg 


ggtggctatt 


ggtggccctg 



cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


ctgaagggee 


tcaccaccag 


ceggggggag 


120 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


cgcctcgagg 


tcccgggcta 


egaggeggae 


360 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


9 a 9tggggga 


gectggaage 


cctcctcaag 


660 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


agggecttte 


tggagaggct 


tgagtttggc 


840 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


gtgctttccc 


geaaggagee 


catgtgggcc 


960 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


gtggcgcggc 


getaeggggg 


ggagtggacg 


1200 


gagaggctcc 


ateggaaect 


ccttaagcgc 


1260 


taccacgagg 


tggaaaagee 


cctctcccgg 


1320 


eggegggacg 


tggcctacct 


tcaggccctt 


1380 


ctcgaggagg 


a ggtcttccg 


ettggeggge 


1440 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


ggcaageget 


ccaccagcgc 


cgcggtgctg 


1560 


gagaagatcc 


tccagcaccg 


acr acre t* p a p p 


1620 


ccaagcctcg 


tccacccgag 


9 a C999ccgc 


1680 


gecaegggga 


ggc t tag tag 


ctccgacccc 


1740 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gaetatagee 


agatagagct 


cagggtgctg 


1860 



175 



gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 24 0 0 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 79 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 79 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 
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Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 
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Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 
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Glu His His His His His His 
835 



<210> 80 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 80 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cgggg a cgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


a ccgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


gg a gtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


a ggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


a ggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 
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cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tccacaacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 81 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 81 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 . 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 
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Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 
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Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 
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Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 82 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 82 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 
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ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


"1 O A A 

lz 0 0 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1 O A A 

1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 83 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 83 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 
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Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 
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Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 
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Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 84 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 84 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 
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aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggecttte 


tggagaggct 


ugaguTzcggc 


OA Pi 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


cccuggagga 


ggccccccgg 


qnn 


cccccgccgg 


aaggggc c 1 1 


eg t gggc 1 1 1 


gtgcttt ccc 


geaaggagee 




yO\J 


gatcttctgg 


ccctggccgc 


cgccaggggc 


9£fccgcgtcc 


accgggcccc 


egagect tat 


± u z u 


aaagccctca 


gggacctgaa 


ggaggegegg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


T A Q a 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


i i /in 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


getaeggegg 


ggagtggacg 


*i o a a 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


Izbu 


cttgaggggg 


aggagaggct 


cctttggctt 


tacegggagg 


tggagaggee 


cctttccgct 




gtcctggccc 


atatggaggc 


caceggggta 


eggegggacg 


tggcctacct 


tcaggccctt 


*1 *3 Q A 

1 j oU 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


ettggeggge 


1 A A A 

144U 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


"1 C A A 

lb UU 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaageget 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaaga t c c 


tccagcaccg 


ggagctcacc 


icon 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


IboU 


ctccacaccc 


gcttcaacca 


gacggccacg 


gecaegggga 


ggcttagtag 


ctccgacccc 


1 Ty| n 

1 /4U 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gaetatagee 


agatagagct 




1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 




acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


T Q Q A 


cgggcggcca 


agaccatcaa 


etteggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


O A /I A 


caggagctag 


ccatccctta 


egaggaggee 


caggccttca 


ttgagegcta 


ctttcagagc 


o n a a 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcggggg t ac 




gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggeccg 


ggtgaagagc 


A A O A 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


A A O A 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gecaaggage 


accaccacca 


ccaccac 


2517 



<210> 85 
<211> 839 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 85 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 
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Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 
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Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 86 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 86 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 30 0 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
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accgccgaca aagaccttta ccagctcctt 
gggtacctca tcaccccggc ctggctttgg 
gccgactacc gggccctgac cggggacgag 
ggggagaaga cggcgaggaa gcttctggag 
aacctggacc ggctgaagcc cgccatccgg 
aagctctcct gggacctggc caaggtgcgc 
aaaaggcggg agcccgaccg ggagaggctt 
agcctcctcc acgagttcgg ccttctggaa 
cccccgccgg aaggggcctt cgtgggcttt 
gatcttctgg ccctggccgc cgccaggggc 
aaagccctca gggacctgaa ggaggcgcgg 
gccctgaggg aaggccttgg cctcccgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggacgccg cccaccgggc cctcctctcg 
ctcgaggggg aggagaagct cctttggctc 
gtcctggccc atatggaggc caccggggta 
tccctggagc ttgcggagga gatccgccgc 
caccccttca acctcaactc ccgggaccag 
cttcccgcct tggggaagac gcaaaagaca 
gaggccctac gggaggccca ccccatcgtg 
aagctcaaga acacctacgt ggaccccctc 
ctccacaccc gcttcaacca gacggccacg 
aacctgcaga acatccccgt ccgcaccccc 
gccgaggagg ggtggctatt ggtggccctg 
gcccacctct ccggcgacga gaacctgatc 
acggagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 



tccgaccgca 


tccacgtcct 


ccaccccgag 


a q n 


gaaaagtacg 


gcctgaggcc 


cgaccagcgg 


RAH 


tccgacaacc 


ttcccggggt 


caagggca u c 


c n n 

gUU 


gagtggggga gcctggaagc 


cctcctcaag 


b o U 


gagaagatcc 


tggcccacat 


ggacgatctg 


Tin 


accqacctgc 

J ZJ 


ccctggaggt 


ggacttcgcc 


780 


agggcctttc 


tqqaqaqqct 


tgagtttggc 


840 


agccccaagg 


ccctggagga 


ggccccctgg 


o a a 
y U U 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


y b u 


ggccgcgtcc 


accgggcccc 


cgagcct t at 


luzu 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1 A Q A 

lUoU 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


i 1 a a 
114 U 


gtggcgcggc 

ZJ ZJ ZJ ZJ ZJ ZJ 


gctacggggg 


ggagtggacg 


1 O A A 

Iz U U 


qaqaqgctcc 

z) zj —jzj 


atcggaacct 


ccttaagcgc 


izbU 


taccacgagg 


tggaaaagcc 


cctctcccgg 


i Tin 


cggcgggacg 


tggcctacct 


tcaggccctt 


IjoU 


Ctcqaqqaqq 

v »- v z) 1 zjzj 1 zjzj 


aqqtcttccq 


cttggcgggc 


144U 


ctqqaaaqqq 

J J ZJ ZJ ZJ 


tqctctttga 


cgagcttagg 


1 C A A 

IbUU 


qqcaaqcqct 


ccaccagcgc 


cgcggtgctg 


1 [T f A 

Ibb U 


gagaagatcc 


tccagcaccg 


ggagctcacc 


i con 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1 O A 


gccacgggga 


qqcttaqtaq 


ctccgacccc 


1 /4U 


ttgggccaga 


QQatCCQCCQ 


ggccttcatc 


1 O A A 


gactatagcc 


agatagagct 


cagggtgctg 




cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2 040 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


sgggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 



192 



ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2 340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 
g&ggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 87 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 87 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 
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Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arq Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arq Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 88 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 88 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
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caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cc tggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 
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acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2 517 

<210> 89 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 89 tt _ T 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Glv Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 



Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 
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Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 
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<210> 90 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 90 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 
ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 
gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 
atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 
gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 10 80 
gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 
gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 
gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 
cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 
ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 
caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 
ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 150 0 
gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 
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gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 
gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 
acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 
tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 
gccttcatcg ccgaggaggg gtggctattg gtggccctgg actatagcca gatagagctc 1860 
agggtgctgg cccacctctc cggcgacgag aacctgatcc gggtcttcca ggaggggcgg 1920 
gacatccaca cggagaccgc cagctggatg ttcggcgtcc cccgggaggc cgtggacccc 1980 
ctgatgcgcc gggcggccaa gaccatcaac ttcggggtcc tctacggcat gtcggcccac 2040 
cgcctctccc aggagctagc catcccttac gaggaggccc aggccttcat tgagcgctac 2100 
tttcagagct tccccaaggt gcgggcctgg attgagaaga ccctggagga gggcaggagg 2160 
cgggggtacg tggagaccct cttcggccgc cgccgctacg tgccagacct agaggcccgg 2220 
gtgaagagcg tgcgggaggc ggccgagcgc atggccttca acatgcccgt ccagggcacc 22 80 
gccgccgacc tcatgaagct ggctatggtg aagctcttcc ccaggctgga ggaaatgggg 2340 
gccaggatgc tccttcaggt ccacaacgag ctggtcctcg aggccccaaa agagagggcg 2400 
gaggccgtgg cccggctggc caaggaggtc atggaggggg tgtatcccct ggccgtgccc 2460 
ctggaggtgg aggtggggat aggggaggac tggctctccg ccaaggagca ccaccaccac 2520 

2526 



caccac 



<210> 91 
<211> 842 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 91 — - -i 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 
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Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 
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Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp lie His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 
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Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 

<210> 92 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 92 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


g&gggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtcca 


ccgggccccc 


1020 


gagccttata 


aagccctcag 


ggacctgaag 


gaggcgcggg 


ggcttctcgc 


caaagacctg 


1080 


agcgttctgg 


ccctgaggga 


aggccttggc 


ctcccgcccg 


gcgacgaccc 


catgctcctc 


1140 
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gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcccggcg 


ctacggcggg 


1200 


gagtggacgg 


aggaggcggg 


ggagcgggcc 


gccctttccg 


agaggctctt 


cgccaacctg 


1260 


tgggggaggc 


ttgaggggga 


ggagaggctc 


ctttggcttt 


accgggaggt 


ggagaggccc 


1320 


ctttccgctg 


tcctggccca 


tatggaggcc 


acgggggtgc 


gcctggacgt 


ggcctatctc 


1380 


agggccttgt 


ccctggaggt 


ggccgaggag 


atcgcccgcc 


tcgaggccga 


ggtcttccgc 


1440 


ctggccggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


cctctttgac 


1500 


gagctagggc 


ttcccgccat 


cggcaagacg 


gagaagaccg 


gcaagcgctc 


caccagcgcc 


1560 


gccgtcctgg 


aggccctccg 


cgaggcccac 


cccatcgtgg 


agaagatcct 


gcagtaccgg 


1620 


gagctcacca 


agctgaagag 


cacctacatt 


gaccccttgc 


cggacctcat 


ccaccccagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacgggcag 


gctaagtagc 


1740 


tccgatccca 


acctccagaa 


catccccgtc 


cgcaccccgc 


ttgggcagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


cca.cca.ccaa 


2520 


caccac 












2526 



<210> 93 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 93 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 



205 



20 



25 



30 



Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 



His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 



206 



Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 
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Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 94 
<211> 2499 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 94 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


c 99 c 9 a 99 aa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 
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aacctggacc 


ggctgaagcc 


cgccatccgg 


aagctctcct 


gggacctggc 


caaggtgcgc 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agcctcctcc 


acgagttcgg 


ccttctggaa 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gatcttctgg 


ccctggccgc 


cgccaggggc 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gaggacgccg 


cccaccgggc 


cctcctctcg 


ctcgaggggg 


aggagaagct 


cctttggctc 


gtcctggccc 


atatggaggc 


caccggggta 


tccctggagc 


ttgcggagga 


gatccgccgc 


caccccttca 


acctcaactc 


ccgggaccag 


cttcccgcct 


tggggaagac 


gcaaaagaca 


gaggccctac 


gggaggccca 


ccccatcgtg 


aagctcaaga 


acacctacgt 


ggaccccctc 


ctccacaccc 


gcttcaacca 


gacggccacg 


aacctgcaga 


acatccccgt 


ccgcaccccc 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gcccacctct 


ccggggacga 


aaacctgatc 


acccagaccg 


caagctggat 


gttcggcgtc 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


caggagcttg 


ccatccccta 


cgaggaggcg 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtcagggagg 


ccgcggagcg 


catggccttc 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ctcctccagg 


tccacgacga 


gctcctcctg 


gcggctttgg 


ccaaggaggc 


catggagaag 


gaggtgggga 


tgggggagga 


ctggctttcc 



gagaagatcc tggcccacat ggacgatctg 720 
accgacctgc ccctggaggt ggacttcgcc 780 
agggcctttc tggagaggct tgagtttggc 840 
agccccaagg ccctggagga ggccccctgg 90 0 
gtgctttccc gcaaggagcc catgtgggcc 960 
ggccgcgtgc accgggcagc agaccccttg 1020 
ggcctcctcg ccaaggacct cgccgtcttg 1080 
ggggacgacc ccatgctcct cgcctacctc 1140 
gtggcgcggc gctacggggg ggagtggacg 1200 
gagaggctcc atcggaacct ccttaagcgc 1260 
taccacgagg tggaaaagcc cctctcccgg 1320 
cggcgggacg tggcctacct tcaggccctt 1380 
ctcgaggagg aggtcttccg cttggcgggc 1440 
ctggaaaggg tgctctttga cgagcttagg 150 0 
ggcaagcgct ccaccagcgc cgcggtgctg 1560 
gagaagatcc tccagcaccg ggagctcacc 1620 
ccaagcctcg tccacccgag gacgggccgc 1680 
gccacgggga ggcttagtag ctccgacccc 1740 
ttgggccaga ggatccgccg ggccttcgtg 1800 
gactatagcc agatagagct ccgcgtcctc 1860 
agggtcttcc aggaggggaa ggacatccac 1920 
cccccggagg ccgtggaccc cctgatgcgc 1980 
ctctacggca tgtccgccca taggctctcc 2040 
gtggccttta tagagcgcta cttccaaagc 2100 
accctggagg aggggaggaa gcggggctac 2160 
gtgcccgacc tcaacgcccg ggtgaagagc 222 0 
aacatgcccg tccagggcac cgccgccgac 2280 
ccccgcctcc gggagatggg ggcccgcatg 234 0 
gaggcccccc aagcgcgggc cgaggaggtg 24 00 
gcctatcccc tcgccgtgcc cctggaggtg 2460 
gccaagggt 2499 



209 



<210> 95 
<211> 833 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 95 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 



210 



Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



211 



Gly Asp Glu Asn Leu lie Arg 
625 630 



Val Phe Gin Glu Gly Lys Asp lie His 
635 640 



Thr Gin Thr Ala Ser Trp Met 
645 



Phe Gly Val Pro Pro Glu Ala Val Asp 
650 655 



Pro Leu Met Arg Arg Ala Ala 
660 



Lys Thr Val Asn Phe Gly Val Leu Tyr 
665 670 



Gly Met Ser Ala His Arg Leu 
675 



Ser Gin Glu Leu Ala lie Pro Tyr Glu 
680 685 



Glu Ala Val Ala Phe lie Glu 
690 695 



Arg Tyr Phe Gin Ser Phe Pro Lys Val 
700 



Arg Ala Trp lie Glu Lys Thr 
705 710 



Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
715 720 



Val Glu Thr Leu Phe Gly Arg 
725 



Arg Arg Tyr Val Pro Asp Leu Asn Ala 
730 735 



Arg Val Lys Ser Val Arg Glu 
740 



Ala Ala Glu Arg Met Ala Phe Asn Met 
745 750 



Pro Val Gin Gly Thr Ala Ala 
755 



Asp Leu Met Lys Leu Ala Met Val Lys 
760 765 



Leu Phe Pro Arg Leu Arg Glu 
770 775 



Met Gly Ala Arg Met Leu Leu Gin Val 
780 



His Asp Glu Leu Leu Leu Glu 
785 790 



Ala Pro Gin Ala Arg Ala Glu Glu Val 
795 800 



Ala Ala Leu Ala Lys Glu Ala 
805 



Met Glu Lys Ala Tyr Pro Leu Ala Val 
810 815 



Pro Leu Glu Val Glu Val Gly 
820 



Met Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



Gly 



<210> 96 
<211> 2499 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 96 

atgaattcgg ggatgctgcc cctctttgag 
caccacctgg cctaccgcac cttccacgcc 
ccggtgcagg cggtctacgg cttcgccaag 
gacgcggtga tcgtggtctt tgacgccaag 
gggtacaagg cgggccgggc ccccacgccg 



cccaagggcc gggtcctcct ggtggacggc 60 
ctgaagggcc tcaccaccag ccggggggag 120 
agcctcctca aggccctcaa ggaggacggg 180 
gccccctcct tccgccacga ggcctacggg 24 0 
gaggactttc cccggcaact cgccctcatc 300 
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aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 



213 



gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 234 0 
ctccttcagg tccacgacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggag 2499 



<210> 97 
<211> 833 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 97 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 



214 



210 



215 



220 



Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 



He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 
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Thr Tyr Val Asp Pro Leu 
545 550 

Leu His Thr Arg Phe Asn 
565 

Ser Ser Asp Pro Asn Leu 
580 

Gin Arg lie Arg Arg Ala 
595 



Pro Ser Leu Val His Pro Arg Thr Gly Arg 
555 560 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
570 575 

Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
585 590 

Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
600 605 



Ala Leu Asp Tyr Ser Gin 
610 

Gly Asp Glu Asn Leu lie 
625 630 

Thr Glu Thr Ala Ser Trp 
645 

Pro Leu Met Arg Arg Ala 
660 

Gly Met Ser Ala His Arg 
675 

Glu Ala Gin Ala Phe He 
690 

Arg Ala Trp He Glu Lys 
705 710 

Val Glu Thr Leu Phe Gly 
725 

Arg Val Lys Ser Val Arg 
740 

Pro Val Gin Gly Thr Ala 
755 

Leu Phe Pro Arg Leu Glu 
770 



He Glu Leu Arg Val Leu Ala His Leu Ser 
615 620 

Arg Val Phe Gin Glu Gly Arg Asp He His 
635 640 

Met Phe Gly Val Pro Arg Glu Ala Val Asp 
650 655 

Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
665 670 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
680 685 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
695 700 

Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
715 720 

Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
730 735 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
745 750 

Ala Asp Leu Met Lys Leu Ala Met Val Lys 
760 765 

Glu Met Gly Ala Arg Met Leu Leu Gin Val 
775 780 



His Asp Glu Leu Val Leu 
785 790 

Ala Arg Leu Ala Lys Glu 
805 

Pro Leu Glu Val Glu Val 
820 



Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
795 800 

Val Met Glu Gly Val Tyr Pro Leu Ala Val 
810 815 

Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



Glu 



<210> 98 

<211> 2499 

<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Synthetic 



<400> 98 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


b u 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


ucaccaccag 


/"t /~*r /~*T T> f~*C 

ccggggggag 


1 on 
1ZU 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


1 on 
loU 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccct cct 


tccgccacga 


ggcct acggg 




gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


z> U U 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


"3 (C n 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 




accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


a o n 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 




gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


zr n n 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


£ r f\ 
DDL) 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


a&ggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 
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aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tccacgacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggt 






2499 



<210> 99 
<211> 833 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 99 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 
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Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 



Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 



Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 
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Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Gly 



<210> 100 

<211> 2499 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 100 
ataaat fcccrcr 


crcra t crc t cr c c 


cctct t tgag 


cccaagggcc 


gggtcctcct 


QcrtercraccrcfC 


60 


caccacctaa 

V** W >w W*. w ~n 


cctaccgcac 


cttccacgcc 


ctgaagggee 


tcaccaccag 


C cqcr qqaqaQ 


120 


p p crcr t crc a crcr 


raatctacaa 


cttcgccaag 


agcctcct ca 


aggccctcaa 


crcracrcra c qqq 
yy ^y y ^^yy y 


180 


era cac crcr taa 


tcgtggtctt 


tgacgccaag 


gccccctcct 


t ccgccacga 


acre ctaccr crcr 


240 


crcrcrt" ara ^5 crcr 
yyy t - a ^ aa yy 


pcrcrcfppcfcrcrp 


ppppa pop per 

\— Vw Vw^ Vw/ d W <w W 


era craaptt tp 


cccggcaact 


cgccctcatc 


300 


ctcty y ciy u u y y 




rjCrnPfP t" CTPjPPj 

yyyy^-^yy^y 


perppt" pcracrer 


t* pppcrcrcrpt a 


pcracrcrccrcjac 
^■■y ^*y ^ y y ^ 


360 


era pert" ppt~ crcr 


p p a crp p 1~ crcrp 


p a a era a ctctp cr 


era aa acr era crcr 


crp t* a paacrcrt" 


ccgcatcct c 


420 


acccrcccraca 


aagaccttta 


ccagct cct t 


t ccgaccgca 


tccacgtcct 


ccaccccgag 


480 


crcrcrt~ a rrtra 


1" p a p p p cctac* 


pi"Cfcrpi~ t" t" crcr 
<~ *-yy *~ *- <~ *-yy 


era aa acr t" a per 


crp p t era crcr p p 


p era p p acr t era 


540 


pyppcrapt" arr 


crercrppci~crac 


c crcr crcr a p cr a cr 
yyyy y y 


tpccrapaacc 


fc t c c ccrcicrcr t 


c a_ a. crcr cr c 3l t c 


600 


cr crcr era ctP5 ^ era 


pcrcrpcra crcra a 
y y y yy 


crp i~ 1 - pt" crcra cr 


era cr1~ crcrcrcrcra 
yciy uyyyyya. 


rrpp ■(- crcra a ctp 
y y ^ a y ^ 


rrhrrtraaa 

^ V—^ l^f V «v \^ tA 


660 


aacctggacc 


ggctgaagee 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgege 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggecttte 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


aegagttegg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcett 


cgtgggcttt 


gtgctttccc 


geaaggagee 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


egagecttat 


1020 


aaagccctca 


gggacctgaa 


ggaggegegg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gecctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


getaeggegg 


ggagtggacg 


1200 


gaggaggegg 


gggageggge 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


tacegggagg 


tggagaggee 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


caegggggtg 


cgcctggacg 


tggectatet 


cagggecttg 


1380 
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tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacgacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggag 






2499 



<210> 101 
<211> 833 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 101 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 
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Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 



223 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 
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Pro Val Gin Gly Thr Ala Ala Asp 
755 760 

Leu Phe Pro Arg Leu Glu Glu Met 
770 775 

His Asp Glu Leu Val Leu Glu Ala 
785 790 

Ala Arg Leu Ala Lys Glu Val Met 
805 

Pro Leu Glu Val Glu Val Gly He 
820 

Glu 



Leu Met Lys Leu Ala Met Val Lys 
765 

Gly Ala Arg Met Leu Leu Gin Val 
780 

Pro Lys Glu Arg Ala Glu Ala Val 
795 800 

Glu Gly Val Tyr Pro Leu Ala Val 
810 815 

Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



<210> 102 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 102 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 
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aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


ggcaccgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 103 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 103 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 
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Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 



Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 



227 



Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly His Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 
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Glu Ala Gin Ala Phe 
690 

Arg Ala Trp lie Glu 
705 

Val Glu Thr Leu Phe 
725 

Arg Val Lys Ser Val 
740 

Pro Val Gin Gly Thr 
755 

Leu Phe Pro Arg Leu 
770 

His Asn Glu Leu Val 
785 

Ala Arg Leu Ala Lys 
805 

Pro Leu Glu Val Glu 
820 

Glu His His His His 
835 



<210> 104 
<211> 27 
<212> DNA 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 104 

gaggaggcgg ggcaccgggc cgccctt 27 



<210> 105 
<2U> 2517 
<212> DNA 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 105 

atgaattcgg ggatgctgcc cctctttgag 
caccacctgg cctaccgcac cttccacgcc 
ccggtgcagg cggtctacgg cttcgccaag 
gacgcggtga tcgtggtctt tgacgccaag 
gggtacaagg cgggccgggc ccccacgccg 



lie Glu Arg Tyr Phe 
695 

Lys Thr Leu Glu Glu 
710 

Gly Arg Arg Arg Tyr 
730 

Arg Glu Ala Ala Glu 
745 

Ala Ala Asp Leu Met 
760 

Glu Glu Met Gly Ala 
775 

Leu Glu Ala Pro Lys 
790 

Glu Val Met Glu Gly 
810 

Val Gly He Gly Glu 
825 

His His 



Gin Ser Phe Pro Lys Val 
700 

Gly Arg Arg Arg Gly Tyr 
715 720 

Val Pro Asp Leu Glu Ala 
735 

Arg Met Ala Phe Asn Met 
750 

Lys Leu Ala Met Val Lys 
765 

Arg Met Leu Leu Gin Val 
780 

Glu Arg Ala Glu Ala Val 
795 800 

Val Tyr Pro Leu Ala Val 
815 

Asp Trp Leu Ser Ala Lys 
830 



cccaagggcc gggtcctcct ggtggacggc 60 
ctgaagggcc tcaccaccag ccggggggag 120 
agcctcctca aggccctcaa ggaggacggg 180 
gccccctcct tccgccacga ggcctacggg 240 
gaggactttc cccggcaact cgccctcatc 300 
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aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctcc 


atcggaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


c gagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 



230 



gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 106 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 106 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
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210 



215 



220 



Leu Lys Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 



He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 
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Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 107 
<211> 36 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 107 

ctttccgaga ggctccatcg gaacctgtgg gggagg 3 6 

<210> 108 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 108 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 60 0 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 78 0 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 0 0 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 12 60 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 
gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 13 8 0 
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tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 
cttcccgcct tggggaagac gcaaaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 
gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 162 0 
aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 
aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 109 
<211> 839 
<212> PRT 

<213> Artificial Sequence 

<220> , , 

<223> Description of Artificial Sequence: Synthetic 

<400> 109 n TT _ _ 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 
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Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 
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Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 
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36 



Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 110 
<211> 36 
<212> DNA 

<213> Artificial Sequence 

<220> ^ , 

<223> Description of Artificial Sequence: Synthetic 

<400> 110 

ctcttcgcca acctgcttaa gaggcttgag ggggag 

<210> 111 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 

<220> ^ , 

<223> Description of Artificial Sequence: Synthetic 

<400> 111 ^ ^ n 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
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aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 
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<210> 112 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 112 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 
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Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 
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Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 



<210> 113 
<211> 27 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 113 

aggccccttt cccgggtcct ggcccat 



<210> 114 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 114 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


c g a gg c gg ac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgccgggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 
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gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cqa.qqa.qqcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 115 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 115 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 
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Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 
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Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arq Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arq Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arq Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 
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Glu His His His His His His 
835 



<210> 116 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 116 

acgggggtgc gccgggacgt ggcctat 

<210> 117 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 117 _ n 
atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6U 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 36 0 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 6 00 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 10 8 0 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 
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ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct ccaggccttg 1380 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 
cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 118 
<211> 839 
<212> PRT 

<213> Artificial Sequence 

<220> ^ . 

<223> Description of Artificial Sequence: Synthetic 

<400> 118 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 
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Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Glv Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 
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Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 
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Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 119 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 119 

gtggcctatc tccaggcctt gtccctg 27 

<210> 120 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 120 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
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accgccgaca aagaccttta ccagctcctt 
gggtacctca tcaccccggc ctggctttgg 
gccgactacc gggccctgac cggggacgag 
ggggagaaga cggcgaggaa gcttctggag 
aacctggacc ggctgaagcc cgccatccgg 
aagctctcct gggacctggc caaggtgcgc 
aaaaggcggg agcccgaccg ggagaggctt 
agcctcctcc acgagttcgg ccttctggaa 
cccccgccgg aaggggcctt cgtgggcttt 
gatcttctgg ccctggccgc cgccaggggc 
gcggggctaa aggacctcaa ggaggtccgg 
gcctcgaggg aggggctaga cctcgtgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggacgccg cccaccgggc cctcctctcg 
ctcgaggggg aggagaagct cctttggctc 
gtcctggccc atatggaggc cacgggggtg 
tccctggagc ttgccgagga gatcgcccgc 
caccccttca acctcaactc ccgggaccag 
cttcccgcca tcggcaagac ggagaagacc 
gaggccctcc gcgaggccca ccccatcgtg 
aagctgaaga gcacctacat tgaccccttg 
ctccacaccc gcttcaacca gacggccacg 
aacctccaga acatccccgt ccgcaccccg 
gccgaggagg ggtggctatt ggtggccctg 
gcccacctct ccggcgacga gaacctgatc 
acggagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 



tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


/"" A A 

600 


gagtggggga 


gcctggaagc 


cctcctcaag 


f f A 

660 


gagaagatcc 


tggcccacat 


ggacgatctg 


no c\ 
i Z\J 


accgacctgc 


ccctggaggt 


ggac 1 1 cgcc 


i q r\ 
/ o U 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


cgcctggacg 


tggcctatct 


cagggccttg 


13 8 0 


ctcgaggccg 


aggtcttccg 


cctggccggc 


144 0 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


n CCA 

1560 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 
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ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 24 00 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 121 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 121 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 
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Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 122 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 122 

ttgtccctgg agcttgccga ggagatc 



27 



255 



<210> 123 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 123 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


c £T99Ccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


W accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


LP gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


, 1 gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


U ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


"j» aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


s aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


1-/5 aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


l z agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccc tggagga 


ggccccctgg 


900 


Q cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatccgccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 
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gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 216 0 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 124 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 124 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 i5 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 SO 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 125 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 125 

gccgaggaga tccgccgcct cgaggcc 2 7 



<210> 126 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 126 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 
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agcctcctcc acgagttcgg ccttctggaa 
cccccgccgg aaggggcctt cgtgggcttt 
gatcttctgg ccctggccgc cgccaggggc 
gcggggctaa aggacctcaa ggaggtccgg 
gcctcgaggg aggggctaga cctcgtgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggacgccg cccaccgggc cctcctctcg 
ctcgaggggg aggagaagct cctttggctc 
gtcctggccc atatggaggc cacgggggtg 
tccctggagg tggccgagga gatcgcccgc 
caccccttca acctcaactc ccgggaccag 
cttcccgcca tcggcaagac ggagaagacc 
gaggccctcc gcgaggccca ccccatcgtg 
aagctgaaga gcacctacat tgaccccttg 
ctccacaccc gcttcaacca gacggccacg 
aacctccaga acatccccgt ccgcaccccg 
gccgaggagg ggtggctatt ggtggccctg 
gcccacctct ccggcgacga gaacctgatc 
acggagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 
ctcatgaagc tggctatggt gaagctcttc 
ctccttcagg tccacaacga gctggtcctc 
gcccggctgg ccaaggaggt catggagggg 
gaggtgggga taggggagga ctggctctcc 

<210> 127 

<211> 839 

<212> PRT 

<213> Artificial Sequence 



agccccaagg 


ccctggagga 


ggccccctgg 


q n d 

yuu 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


y o u 


ggccgcgtgc 


accgggcagc 


agaccccttg 


t a *■> a 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


t a o a 




ccatgctcct 


cgcc tacct c 


i t a r\ 
1.-L4 U 


gtggcgcggc 


gctacggggg 


ggagtggacg 


i onn 


gagaggctcc 


atcggaacct 


ccttaagcgc 


IZo U 


taccacgagg 


tggaaaagcc 


cctctcccgg 


*1 "3 O A 


cgcctggacg 


tggcctatct 


cagggccttg 


1 O O A 

13 80 


ctcgaggagg 


aggtcttccg 


cctggccggc 


T A A A 
±440 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1 [T A A 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


IbbU 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1 f~ O A 


ccggacctca 


tccaccccag 


gacgggccgc 


icon 
loo U 


gccacgggca 


ggctaagt ag 


ctccgatccc 


1 ""7 A A 


cttgggcaga 


ggatccgccg 


ggccttcatc 


*1 O A A 

XoU 0 


gactatagcc 


agatagagct 


cagggtgctg 


T O f A 

1860 


cgggtcttcc 


aggaggggcg 


ggacatccac 


"! Q '") A 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


T Q O A 

ly 80 


ctctacggca 


tgtcggccca 


ccgcctctcc 


O A A A 
U4 0 


caggccttca 


ttgagcgcta 


ctttcagagc 


1 1 A A 

zlOO 


accctggagg 


agggcaggag 


gcgggggtac 


^160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 




cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 127 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
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305 

Asp Leu Leu 

Ala Asp Pro 

Leu Ala Lys 
355 

Val Pro Gly 
370 

Asn Thr Thr 
385 

Glu Asp Ala 
Leu Leu Lys 



Ala Leu 
325 

Leu Ala 
340 

Asp Leu 
Asp Asp 
Pro Glu 



Ala His 
405 

Arg Leu 
420 



310 315 

Ala Ala Ala Arg Gly Gly Arg 
330 

Gly Leu Lys Asp Leu Lys Glu 
345 

Ala Val Leu Ala Ser Arg Glu 
360 

Pro Met Leu Leu Ala Tyr Leu 
375 380 

Gly Val Ala Arg Arg Tyr Gly 
390 395 

Arg Ala Leu Leu Ser Glu Arg 
410 

Glu Gly Glu Glu Lys Leu Leu 
425 



320 

Val His Arg Ala 
335 

Val Arg Gly Leu 
350 

Gly Leu Asp Leu 
365 

Leu Asp Pro Ser 



Gly Glu Trp Thr 
400 

Leu His Arg Asn 
415 

Trp Leu Tyr His 
430 



Glu Val Glu 
435 

Gly Val Arg 
450 

Ala Glu Glu 
465 

His Pro Phe 

Asp Glu Leu 

Arg Ser Thr 
515 

He Val Glu 
530 

Thr Tyr He 
545 

Leu His Thr 
Ser Ser Asp 



Gin Arg He 
595 

Ala Leu Asp 
610 

Gly Asp Glu 
625 



Lys Pro Leu Ser Arg Val Leu Ala His 
440 



Leu Asp Val Ala Tyr Leu Arg 
455 

He Ala Arg Leu Glu Glu Glu 
470 

Asn Leu Asn Ser Arg Asp Gin 
485 490 

Gly Leu Pro Ala He Gly Lys 
500 505 

Ser Ala Ala Val Leu Glu Ala 
520 

Lys He Leu Gin Tyr Arg Glu 
535 

Asp Pro Leu Pro Asp Leu He 
550 

Arg Phe Asn Gin Thr Ala Thr 
565 570 

Pro Asn Leu Gin Asn He Pro 
580 585 

Arg Arg Ala Phe He Ala Glu 
600 

Tyr Ser Gin He Glu Leu Arg 
615 

Asn Leu He Arg Val Phe Gin 
630 



Ala Leu 
460 

Val Phe 
475 

Leu Glu 

Thr Glu 

Leu Arg 

Leu Thr 
540 

His Pro 
555 

Ala Thr 
Val Arg 
Glu Gly 



Val Leu 
620 

Glu Gly 
635 



Met Glu Ala Thr 
445 

Ser Leu Glu Val 



Arg Leu Ala Gly 
480 

Arg Val Leu Phe 
495 

Lys Thr Gly Lys 
510 

Glu Ala His Pro 
525 

Lys Leu Lys Ser 



Arg Thr Gly Arg 
560 

Gly Arg Leu Ser 
575 

Thr Pro Leu Gly 
590 

Trp Leu Leu Val 
605 

Ala His Leu Ser 



Arg Asp He His 
640 
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Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arq Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arq Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 128 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> , 
<223> Description of Artificial Sequence: Synthetic 

<400> 128 

gcccgcctcg aggaggaggt cttccgc 



<210> 129 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 129 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
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caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaagg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 
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acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cg a 99 a ggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


a ggg ca gg a g 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 130 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 130 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Glv Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 
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Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 
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<210> 131 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 131 

tttgacgagc taaggcttcc cgccatc 27 

<210> 132 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 132 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 30 0 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 120 0 
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gaggacgccg 


cccaccgggc 


cctcctctcg 


gagag gc t cc 


cLLLy ya-dL-L L 


ol. l ucxciy L-y l- 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


caccacgagg 


Lygaaa.a.gLL 


ll- u L L^L-^yy 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcc uggacg 


t*ggLLLci it* l 


uay yy l»l» l *—y 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


agg l. l l. l LL.y 


r* p 1 t" an pp crcr r 1 

lu Lyy LLyy l 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 




cgagcuagyy 


1 ROD 

JL »3 V \J 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


/— i /~i /~i ji-* 4— /— < 4— 

cyccy tccty 


1 jDU 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagc. llcxl,l 




aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


LLLaLL.L.Lciy 


ycLLyyy LLy l 


1680 


ctccacaccc 


gcttcaacca 


cracQQC c acq 


qccacqqqca 


y y L- l day l ciy 


/-t/-i na t" p* p 1 r* 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttqqqcaqa 


y y a. l L*Ly L»Ly 


nrrp r* "h I - P 1 p. t~ p* 

yML-^LUUCLL^ 


1800 


gccgaggagg 


ggtggctatt 


acrtQcrccctcr 


gactatagcc 


\ -a 4— -3 /"fa rr^" 1 

ayatdydyL. L 


^^yy y L y L y 


1860 


gcccacctct 


c cggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


yyaL.dLLL.au 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccguggaccc 


cccyatyLyL 




cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tyucyyCCCd 


/-I /-* ft f~* f~ f~* \~ f~* 
LLytLLLLLL 


Zi \J t: VJ 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


tcgagcgcua 


/— 1 4— 4- 4- y— i a rta {~tf~* 

ct LLLdydyL 


91 DO 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcggggg^ac 


?i fin 

Z> X O \J 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tzagaggcccg 


ggtgaagagc 


9 9 9 0 
zzzu 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


LCcaggyLcAL 


oy LLy LLy o.l 


22 80 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 133 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 133 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 
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Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 
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Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 134 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 134 

atcgccaaga cgcaaaagac cggcaag 

<210> 135 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 135 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
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gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 60 0 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 7 80 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 12 0 0 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 132 0 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 
cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagcaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 180 0 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 198 0 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 04 0 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
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ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 136 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 136 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
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245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 
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Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 137 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 137 

aagatcctgc agcaccggga gctcacc 27 
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<210> 138 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



,/j n TOO 

<4 U U > Ijo 

atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cgggg a cgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


gg a ggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 
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gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


162 0 


aagctgaaga 


acacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 139 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 139 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 



Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 140 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 140 

accaagctga agaacaccta cattgac 27 



<210> 141 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 141 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 
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agcctcctcc acgagttcgg ccttctggaa 
cccccgccgg aaggggcctt cgtgggcttt 
gatcttctgg ccctggccgc cgccaggggc 
gcggggctaa aggacctcaa ggaggtccgg 
gcctcgaggg aggggctaga cctcgtgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggacgccg cccaccgggc cctcctctcg 
ctcgaggggg aggagaagct cctttggctc 
gtcctggccc atatggaggc cacgggggtg 
tccctggagg tggccgagga gatcgcccgc 
caccccttca acctcaactc ccgggaccag 
cttcccgcca tcggcaagac ggagaagacc 
gaggccctcc gcgaggccca ccccatcgtg 
aagctgaaga gcacctacgt ggaccccttg 
ctccacaccc gcttcaacca gacggccacg 
aacctccaga acatccccgt ccgcaccccg 
gccgaggagg ggtggctatt ggtggccctg 
gcccacctct ccggcgacga gaacctgatc 
acggagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 
ctcatgaagc tggctatggt gaagctcttc 
ctccttcagg tccacaacga gctggtcctc 
gcccggctgg ccaaggaggt catggagggg 
gaggtgggga taggggagga ctggctctcc 

<210> 142 
<211> 839 
<212> PRT 

<213> Artificial Sequence 



agccccaagg ccctggagga ggccccctgg 900 
gtgctttccc gcaaggagcc catgtgggcc 960 
ggccgcgtgc accgggcagc agaccccttg 102 0 
ggcctcctcg ccaaggacct cgccgtcttg 1080 
ggggacgacc ccatgctcct cgcctacctc 1140 
gtggcgcggc gctacggggg ggagtggacg 12 0 0 
gagaggctcc atcggaacct ccttaagcgc 1260 
taccacgagg tggaaaagcc cctctcccgg 1320 
cgcctggacg tggcctatct cagggccttg 1380 
ctcgaggccg aggtcttccg cctggccggc 1440 
ctggaaaggg tcctctttga cgagctaggg 1500 
ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gagaagatcc tgcagtaccg ggagctcacc 162 0 
ccggacctca tccaccccag gacgggccgc 1680 
gccacgggca ggctaagtag ctccgatccc 174 0 
cttgggcaga ggatccgccg ggccttcatc 18 00 
gactatagcc agatagagct cagggtgctg 1860 
cgggtcttcc aggaggggcg ggacatccac 1920 
ccccgggagg ccgtggaccc cctgatgcgc 1980 
ctctacggca tgtcggccca ccgcctctcc 2 040 
caggccttca ttgagcgcta ctttcagagc 210 0 
accctggagg agggcaggag gcgggggtac 2160 
gtgccagacc tagaggcccg ggtgaagagc 222 0 
aacatgcccg tccagggcac cgccgccgac 22 80 
cccaggctgg aggaaatggg ggccaggatg 2340 
gaggccccaa aagagagggc ggaggccgtg 2400 
gtgtatcccc tggccgtgcc cctggaggtg 2460 
gccaaggagc accaccacca ccaccac 2517 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 142 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 



284 



305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 
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Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 143 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 143 

aagagcacct acgtggaccc cttgccg 27 



<210> 144 
<211> 2517 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 144 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
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caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 60 0 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 90 0 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 96 0 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 12 00 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 126 0 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 
cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccgagcctcg tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 
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acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag 


accrcracrcrt ac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


crat* aaaaaac 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 




ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 145 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 145 

Met Asn Ser Gly Met Leu 
1 5 

Leu Val Asp Gly His His 
20 

Gly Leu Thr Thr Ser Arg 
35 

Ala Lys Ser Leu Leu Lys 
50 

Val Val Phe Asp Ala Lys 
65 70 

Gly Tyr Lys Ala Gly Arg 
85 

Leu Ala Leu lie Lys Glu 
100 

Glu Val Pro Gly Tyr Glu 
115 

Lys Ala Glu Lys Glu Gly 
130 

Asp Leu Tyr Gin Leu Leu 
145 150 

Gly Tyr Leu lie Thr Pro 
165 



Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
10 15 

Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
25 30 

Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
40 45 

Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
55 60 

Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
75 80 

Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
90 95 

Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
105 110 

Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
120 125 

Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 

135 140 

Ser Asp Arg lie His Val Leu His Pro Glu 
155 160 



Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 
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Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 
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<210> 146 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 146 

attgacccct tgccgagcct cgtccacccc aggacgggc 3 9 

<210> 147 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 147 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 60 0 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 96 0 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 10 8 0 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 120 0 
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gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 144 0 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 
cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 204 0 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 148 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 148 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 
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Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 



Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 
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Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 



Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 149 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 149 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 
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agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttgggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 150 

<211> 839 

<212> PRT 

<213> Artificial Sequence 



296 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 150 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 - 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
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305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 
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Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 151 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 151 

ttcgccaacc tgcttgggag gcttgagggg gag 33 



<210> 152 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 152 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
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caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtggaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggc cgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 
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acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 153 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 153 

Met Asn Ser Gly Met Leu Pro Leu 
1 5 

Leu Val Asp Gly His His Leu Ala 
20 

Gly Leu Thr Thr Ser Arg Gly Glu 
35 40 

Ala Lys Ser Leu Leu Lys Ala Leu 
50 55 

Val Val Phe Asp Ala Lys Ala Pro 
65 70 

Gly Tyr Lys Ala Gly Arg Ala Pro 
85 

Leu Ala Leu lie Lys Glu Leu Val 
100 

Glu Val Pro Gly Tyr Glu Ala Asp 
115 120 

Lys Ala Glu Lys Glu Gly Tyr Glu 
130 135 

Asp Leu Tyr Gin Leu Leu Ser Asp 
145 150 

Gly Tyr Leu lie Thr Pro Ala Trp 
165 



Sequence: Synthetic 



Phe Glu Pro Lys Gly Arg Val Leu 
10 15 

Tyr Arg Thr Phe His Ala Leu Lys 
25 30 

Pro Val Gin Ala Val Tyr Gly Phe 
45 

Lys Glu Asp Gly Asp Ala Val lie 
60 

Ser Phe Arg His Glu Ala Tyr Gly 
75 80 

Thr Pro Glu Asp Phe Pro Arg Gin 
90 95 

Asp Leu Leu Gly Leu Ala Arg Leu 
105 110 

Asp Val Leu Ala Ser Leu Ala Lys 
125 

Val Arg lie Leu Thr Ala Asp Lys 
140 

Arg lie His Val Leu His Pro Glu 
155 160 

Leu Trp Glu Lys Tyr Gly Leu Arg 
170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 
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Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 
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<210> 154 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 154 

ctcttcgcca acctgtggaa gaggcttgag ggg 



<210> 155 
<211> 835 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Synthetic 
<400> 155 

Met Glu Phe Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
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210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Pro Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Glu Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 
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Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly lie Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys lie Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 



<210> 156 

<211> 2526 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 156 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


gccgggtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 
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tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 
gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 186 0 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 192 0 
gacatcgcca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 210 0 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 
gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

2526 



caccac 



<210> 157 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 157 ^ 
Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 SO 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 
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Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 
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Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
62 5 630 635 640 

Asp He Ala Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 
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Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 158 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 158 

gcttgcggtc tgggtggcga tgtccttccc etc 33 



<210> 159 

<211> 2526 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 159 



Q atgaattccg 


aggegatget 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


^" ggccaccacc 


tggcctaccg 


caccttcttc 


gecctgaagg 


gcctcaccac 


gagcegggge 


120 


O gaaccggtgc 


aggeggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


# tacgaggect 


acaaggeggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


f*! ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


eggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


egaggtgege 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtaeggect 


caggceggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


eggggtcaag 


600 


ggcatcgggg 


agaagacege 


cctcaagctc 


ctcaaggagt 


ggggaagect 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


egggtgegea 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggeggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagtteggea 


gcctcctcca 


egagttegge 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggectte 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 
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atgtgggcgg 


agcttaaagc 


cctggccgcc 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gccgtcttgg 


cctcgaggga 


ggggctagac 


gcctacctcc 


tggacccttc 


gaacaccacc 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctctcccggg 


tcctggccca 


tatggaggcc 


caggcccttt 


ccctggagct 


tgcggaggag 


ttggcgggcc 


accccttcaa 


cctcaactcc 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


gcggtgctgg 


aggccctacg 


ggaggcccac 


gagctcacca 


agctcaagaa 


cacctacgtg 


acgggccgcc 


tccacacccg 


cttcaaccag 


tccgacccca 


acctgcagaa 


catccccgtc 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


gacatccaca 


cccagaccgc 


aagctggatg 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


aggctctccc 


aggagcttgc 


catcccctac 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


cggggctacg 


tggaaaccct 


cttcggaaga 


gtgaagagcg 


tcagggaggc 


cgcggaggcc 


gccgccgacc 


tcatgaagct 


cgccatggtg 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


gaggaggtgg 


cggctttggc 


caaggaggcc 


ctggaggtgg 


aggtggggat 


gggggaggac 


caccac 







<210> 160 

<211> 842 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
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tgcaggggcg gccgcgtgca ccgggcagca 1020 
g a gg tcc ggg gcctcctcgc caaggacctc 1080 
ctcgtgcccg gggacgaccc catgctcctc 1140 
cccgaggggg tggcgcggcg ctacgggggg 120 0 
ctcctctcgg agaggctcca tcggaacctc 1260 
ctttggctct accacgaggt ggaaaagccc 1320 
accggggtac ggcgggacgt ggcctacctt 1380 
atccgccgcc tcgaggagga ggtcttccgc 1440 
cgggaccagc tggaaagggt gctctttgac 1500 
caaaagacag gcaagcgctc caccagcgcc 1560 
cccatcgtgg agaagatcct ccagcaccgg 1620 
gaccccctcc caagcctcgt ccacccgagg 1680 
acggccacgg ccacggggag gcttagtagc 174 0 
cgcaccccct tgggccagag gatccgccgg 1800 
gtggccctgg actatagcca gatagagctc 1860 
aacctgatca gggtcttcca ggaggggaag 1920 
ttcggcgtcc ccccggaggc cgtggacccc 1980 
ttcggcgtcc tctacggcat gtccgcccat 2 040 
gaggaggcgg tggcctttat agagcgctac 2100 
atagaaaaga ccctggagga ggggaggaag 2160 
aggcgctacg tgcccgacct caacgcccgg 2 220 
atggccttca acatgcccgt ccagggcacc 2280 
aagctcttcc cccgcctccg ggagatgggg 2340 
ctcctcctgg aggcccccca agcgcgggcc 2400 
atggagaagg cctatcccct cgccgtgccc 2460 
tggctttccg ccaagggtca ccaccaccac 2520 

2526 



Synthetic 



<400> 160 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tvr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 HO 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arq Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
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325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Ala Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 161 
<211> 33 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 161 

catgttgaag gccatggcct ccgcggcctc cct 3 3 



<210> 162 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 162 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 
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gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


ggg t acaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


eggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


egaggtgege 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtaeggect 


caggceggag 


540 


cagtgggtgg 


acttccgcgc 


c c t eg t gggg 


gacccctccg 


acaacctccc 


eggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagect 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


72 0 


gaagacctca 


ggctctcctt 


ggagctctcc 


egggtgegea 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


a 9999 c 999 a 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


egagttegge 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggectte 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtceggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


etaegggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


gg a g aa gctc 


ctttggctct 


accacgaggt 


ggaaaagece 


1320 


ctctcccggg 


tcctggccca 


tatggaggee 


aceggggtae 


99 c 999 ac 9t 


ggcctacctt 


13 80 


caggcccttt 


ccctggagct 


tgeggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ecaeggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggecagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatageca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


eggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 
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ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 040 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 234 0 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 240 0 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 
caccac 2526 



<210> 163 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 163 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 
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Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
3V0 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arq 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 
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Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly His His His His His His 
835 840 
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<210> 164 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 164 

caggaggagc tcgttggcga cctggaggag 3 0 

<210> 165 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 165 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 
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gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


« 4— 4— 4— « 4— 

ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


13 8 0 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


192 0 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 
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<210> 166 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 166 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 
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Lys Gly 

Phe Ala 
50 

Val Phe 
65 

Tyr Glu 
Arg Gin 
Arg Leu 



Leu Thr 
35 



Thr Ser 



Lys Ser Leu Leu 
Val Val 
Ala Tyr 



Phe Asp 
70 



Leu Ala 
100 

Glu Val 
115 



Ala Lys 
130 



Lys Ala 
Asp Leu 
Pro Glu Gly His 



Asp Arg 
145 



Lys Ala 
85 

Leu lie 



Pro Gly 
Glu Lys 



Tyr Gin 
150 

Leu lie 
165 



Arg Gly 
40 

Lys Ala 
55 

Ala Lys 
Gly Arg 
Lys Glu 



Glu Pro Val Gin 
Leu Lys 
Ala Pro 



Glu Asp 
60 



Ser Phe 
75 



Ala Val Tyr Gly 
45 

Gly Tyr Lys Ala 



Arg His Glu Ala 
80 



Ala Pro 
90 

Leu Val 
105 



Thr Pro Glu Asp Phe Pro 
95 



Tyr Glu 
120 



Ala Asp 
Tyr Glu 
Leu Val Ser Asp 



Glu Gly 
135 



Thr Pro 



Leu Arg 
Ser Asp 



Pro Glu 
180 

Asn Leu 
195 



Lys Leu 
210 



Leu Lys 
Val Lys 
Glu Asp Leu Arg 



Gin Trp Val Asp 
Pro Gly 
Glu Trp 



Val Lys 
200 



Asp Arg 
225 



Pro Leu 

Leu Arg 

Phe Gly 
290 

Pro Pro 
305 

Met Trp 
His Arg 
Arg Gly 



Glu Val 
260 

Ala Phe 
275 



Pro Glu 
230 

Leu Ser 
245 

Asp Leu 
Leu Glu 



Leu Leu Glu Ala 



Gly Ser 
215 

Asn Val 
Leu Glu 
Ala Gin 



Arg Leu 
280 

Pro Ala 
295 



Glu Trp 
170 

Phe Arg 
185 

Gly lie 
Leu Glu 
Arg Glu 



Asp Leu 

Asp Val 

Val Arg 
140 

Arg Val 
155 

Leu Trp 
Ala Leu 
Gly Glu 



Asn Leu 
220 

Lys lie 
235 



Leu Ser 
250 



Arg Val 
Glu Pro 
Glu Phe Gly Ser 



Gly Arg 
265 



Pro Leu 



Glu Gly 
Ala Glu 



Ala Ala 
340 

Leu Leu 
355 



Ala Phe 
310 

Leu Lys 
325 



Val Gly Phe Val 



Glu Glu 
300 

Leu Ser 
315 



Ala Leu 



Ala Ala 
330 



Asp Pro Leu Ala 

Ala Lys Asp Leu 
360 



Cys Arg 
Lys Asp 
Ala Val Leu Ala 



Gly Leu 
345 



Leu Gly Phe Thr 
110 

Leu Ala Thr Leu 
125 

lie Leu Thr Ala 



Ala Val Leu His 
160 

Glu Lys Tyr Gly 
175 

Val Gly Asp Pro 
190 

Lys Thr Ala Leu 
205 

Leu Lys Asn Leu 



Lys Ala His Leu 
240 

Arg Thr Asp Leu 
255 

Asp Arg Glu Gly 
270 

Leu Leu His Glu 
285 

Ala Pro Trp Pro 



Arg Pro Glu Pro 
320 

Gly Gly Arg Val 
335 

Leu Lys Glu Val 
350 

Ser Arg Glu Gly 
365 
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Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 
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Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 167 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 167 

ggagcgcttg cctgtcttct tcgtcttctt caaggcggga ggcct 45 

<210> 168 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 168 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
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a r* crrr* c^rra p<a 
dLLyLLydLd 




f~i s-i -~i /-"t/— i 4- e-» /-i 4- 4- 

CCdgCLCCtL 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


cincri~ a c t (~ ,J r p 1 a 

y y y ^-dLL. 


pa p" 1 p* 1 P 1 (~t (~if~*(-i 
LLdLLLLyyL 


ctggcxuugg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


y L-^y O.L- UdLL 


rrrrrfppp I - py a p* 

yyyLC-oLyaL 


c yyyy ac y a 9 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


rrprprpra pra 3 i^a 
y y y y ay ddy d 


c y9 c y a yyaa 


gcttctggag 


gagtggggga 


gectggaage 


cctcctcaag 


660 


aaLL uy y dL l 


(Trfpfrra appp 

yy ocyaayce 


/— ^ /—i /~i -~i *f— f*rs~*r 

LyCCaCCCyy 


gagaaga t c c 


tggcccacat 


ggacgatctg 


720 


aay l HjOLuI- 


ggydcctggc 


caaggtgege 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


dddcty y Ly y y 


dyLCCydCCy 


ggagaggct u 


agggecttte 


tggagaggct 


tgagtttggc 


840 


a/T^/" 1 "} - P" 1 P 1 "h P 1 p" 1 


aegagcuegg 


cccuc eggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


PIplptp^plp-fpiplpTPY 


day y y gc cut 


cgtgggcttt 


gtgctttccc 


geaaggagee 


catgtgggcc 


960 


na f - p-«4- t" p** 1~ ptpt 
yciu^ L- u l Ly y 


P* P" 1 P" 1 4- prprp /-i /— r /-I 

LLL uyyecy c 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


egagecttat 


1020 


d d dy LuLLUd 


gyy accugaa 


99 a 99cgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


pfj-ip/r(- /— r -21 pTprpf 

yocc^uydygy 


aaggccutgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


✓ 1 4— {-*ti~r i r~\ r~t /*-*. 4— 4- 

0 LyydCCCLt. 


cgaacaccac 


ccccgagggg 


gtggcccggc 


getaeggegg 


99 a 9tggacg 


1200 


gaggaggcgg 


999 a 9 c 999c 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


p* t~ t* pra PTPrrrrTPT 

c u tydy gggg 


a yy a y a y9 ct 


cct u uggece 


tacegggagg 


tggagaggee 


cctttccgct 


1320 


y l l l L-yy 


af- af- ptpt a prpip 

dtdLyyaggc 


caegggggug 


cgcctggacg 


tggectatet 


cagggecttg 


1380 


lllc i_y y dy y 


i-ggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


L aLL LL- L. L>Ld 


ap i r , *'} - P i aap H T~p i 

dLL LLddL L L 


/~i T> /"i /~i ^ /-t 

ccggyaccag 


ctggaaaggg 


ccctcEt tga 


cgagctaggg 


1500 


L L LLLLMLL d 


4- p^pTPfpi a a rra /~> 

LLyyLddy at- 


ycdaaagacc 


ggcaageget 


ccaccagcgc 


cgccgtcctg 


1560 


pra rtrfp ^ /-» 4— c~i /— < 
y dy yOOOLLC 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aarrpf - na a pt a 
day l. i_y day d 


gCdCCUdCdt 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 




y L L. LtddLLd 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


ddL L ILL dy a 


dCdUCCCCgc 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


y L^y dy gayy 


/-fiT 4— /~t s~\ 4^ — * 4— 4— 

ygtggcuacu 


ggtggccctg 


gaetatagee 


agatagagct 


cagggtgctg 


1860 


etc* ppappf 4— 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


aLyydydCcy 


ccagcTzggac 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


p« prrrrfprrrrp p« a 

^--yyy ^yy^ca 


a /T a ^ /~i -a 4- y-t -n 

dydCCdtCaa 


etteggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


egaggaggee 


caggccttca 


ttgagegcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gegggggtae 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggeccg 


ggtgaagagc 


2220 


gtgegggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 
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ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2 517 

<210> 169 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 169 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arq Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
H5 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arq 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 
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Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 



Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 170 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 170 

gaggaccagc tcgttggcga cctgaaggag cat 
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<210> 171 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 171 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tqgagaqqct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaqgaqcc 


catqtqqqcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 
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gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatcgcc 192 0 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 204 0 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 210 0 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 240 0 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 172 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 172 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie Ala 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 173 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 173 

gaggggcggg acatcgccac ggagaccgcc age 33 



<210> 174 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 174 
atgaattegg 


ggatgetgee 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggee 


tcaccaccag 


ceggggggag 


120 


ccggtgcagg 


eggtctaegg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


egaggeggae 


360 


gaegtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


eggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


eggegaggaa 


gcttctggag 


gagtggggga 


gectggaage 


cctcctcaag 


660 


aacctggacc 


ggctgaagee 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgege 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggecttte 


tggagaggct 


tgagtttggc 


840 
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agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


cccuggagga 


ggccccc ugg 




cccccgccgg 


aaggggccuu 


cgcgggcuu u 


/■i 1 it ft 4— 4— 4- /"~« /~i /-t 

gtycttL-ccc 


gc aagy ay c c 


oatyi-yyyce 




gatcttctgg 


ccctggccgc 


cgc c aggggc 


ggccgcgtcc 


accgggcccc 


egagee L.uac 




aaagccctca 


gggacctgaa 


ggaggegegg 


gggc u u c u eg 


ccaaagacc u 


gagegtec ug 


i n r n 

1UOU 


gccctgaggg 


aaggccttgg 


cct cccgccc 


ggcgacgacc 


ccatgctcc t 


/*-| +-* ' — \ /"I /~1 +— 

CyCCLdCCLC 




ctggaccctt 


cgaacaccac 


ccccgagggg 


guggc cegge 


gc caeggegg 


ggagcggacg 


i 9 n n 

1ZUU 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggc t c t 


t cgccaacct 


get taagagg 


X.A O U 


cttgaggggg 


aggagaggct 


cct t tggct t 


t acegggagg 


tggagaggee 


cc tt t ccget 


1 U 


gtcctggccc 


atatggaggc 


caegggggtg 


cgcctggacg 


tggectat ct 


cagggee ttg 


1 j oU 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggt ct t ccg 


cctggccggc 




caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


t cct ctt tga 


cgagctaggg 




cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaageget 


ccaccagcgc 


cgccgtcctg 


Ibo U 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


lo^ U 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


g&cgggccgc 


icon 
Ibo U 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


i 7/1 n 

1 /4 U 


aacctccaga 


acatccccgt 


cgccaccccg 


cttgggcaga 


ggatccgccg 


ggcett cat c 


loUU 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gaetatagee 


agauagagcu. 


cagggege ug 


loo U 


gcccacctct 


ccggcgacga 


gaacctgatc 


egggt ctt cc 


aggaggggcg 


ggacat ccac 


i q o n 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


iy ou 


cgggcggcca 


agaccatcaa 


etteggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


ZU4U 


caggagctag 


ccatccctta 


c 9 a 99 a 99 cc 


caggccttca 


ttgagegcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


9 c 99999 t a c 




gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggeccg 


ggtgaagagc 


ZZZ U 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


O O Q A 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


gga-ggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gecaaggage 


accaccacca 


ccaccac 


2517 



<210> 175 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 175 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
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305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Ala Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 



336 



Thr Glu Thr Ala Ser 
645 

Pro Leu Met Arg Arg 
660 

Gly Met Ser Ala His 
675 

Glu Ala Gin Ala Phe 
690 

Arg Ala Trp lie Glu 
705 

Val Glu Thr Leu Phe 
725 

Arg Val Lys Ser Val 
740 

Pro Val Gin Gly Thr 
755 

Leu Phe Pro Arg Leu 

770 

Ala Asn Glu Leu Val 
785 

Ala Arg Leu Ala Lys 
805 

Pro Leu Glu Val Glu 
820 

Glu His His His His 
835 



<210> 176 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 176 

cagaacatcc ccgtcgccac cccgcttggg cag 3 3 



<210> 177 
<211> 2517 
<212> DNA 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 177 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 



Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
650 655 

Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
665 670 

Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
680 685 

lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
695 700 

Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
710 715 720 

Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
730 735 

Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
745 750 

Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
760 765 

Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
775 780 

Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
790 795 800 

Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
810 815 

Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



His His 
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caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcaagaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 
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acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 04 0 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 210 0 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 24 00 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 178 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 178 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
!95 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arq 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trn Thr 
385 390 395 y 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arcf 
420 4 25 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys Thr Gly Lys 
500 505 510 
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Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 
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<210> 179 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 179 

gggcttcccg ccatcaagaa gacggagaag acc 33 

<210> 180 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 180 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 24 0 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 6 00 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 10 8 0 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 
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gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 12 60 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 8 0 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 144 0 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 150 0 
cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 181 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 181 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 
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Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 no 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arcr 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 
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Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg 
725 

Arg Val Lys Ser Val Arg Glu 
740 

Pro Val Gin Gly Thr Ala Ala 
755 

Leu Phe Pro Arg Leu Glu Glu 
770 775 

Ala Asn Glu Leu Val Leu Glu 
785 790 

Ala Arg Leu Ala Lys Glu Val 
805 

Pro Leu Glu Val Glu Val Gly 
820 

Glu His His His His His His 
835 



<210> 182 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 182 

ctagggcttc ccgccatcaa gaagacgcaa aagaccggc 39 

<210> 183 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 183 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc cgagacggag gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 



Arg Arg Tyr Val 
730 

Ala Ala Glu Arg 
745 

Asp Leu Met Lys 
760 

Met Gly Ala Arg 



Ala Pro Lys Glu 
795 

Met Glu Gly Val 
810 

lie Gly Glu Asp 
825 



Pro Asp Leu Glu Ala 
735 

Met Ala Phe Asn Met 
750 

Leu Ala Met Val Lys 
765 

Met Leu Leu Gin Val 
780 

Arg Ala Glu Ala Val 
800 

Tyr Pro Leu Ala Val 
815 

Trp Leu Ser Ala Lys 
830 
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tcttcggccg 


ccgccgctac 


gtgcgggagg 


cggccgagcg 


catggccttc 


ctcatgaagc 


tggctatggt 


gaagctcttc 



gaaaagtacg gcctgaggcc cgaccagtgg 540 
tccgacaacc ttcccggggt caagggcatc 6 00 
9 a 9tggggga gcctggaagc cctcctcaag 660 
gagaagatcc tggcccacat ggacgatctg 72 0 
accgacctgc ccctggaggt ggacttcgcc 7 80 
agggcctttc tggagaggct tgagtttggc 840 
agccccaagg ccctggagga ggccccctgg 90 0 
gtgctttccc gcaaggagcc catgtgggcc 960 
ggccgcgtcc accgggcccc cgagccttat 102 0 
gggcttctcg ccaaagacct gagcgttctg 1080 
ggcgacgacc ccatgctcct cgcctacctc 1140 
gtggcccggc gctacggcgg ggagtggacg 12 0 0 
gagaggctct tcgccaacct gcttaagagg 12 60 
taccgggagg tggagaggcc cctttccgct 132 0 
cgcctggacg tggcctatct cagggccttg 1380 
ctcgaggccg aggtcttccg cctggccggc 1440 
ctggaaaggg tcctctttga cgagctaggg 150 0 
ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gagaagatcc tgcagtaccg ggagctcacc 162 0 
ccggacctca tccaccccag gacgggccgc 1680 
gccacgggca ggctaagtag ctccgatccc 174 0 
ctt 999C a 9 a ggatccgccg ggccttcatc 1800 
gactatagcc agatagagct cagggtgctg 1860 
cgggtcttcc aggaggggcg ggacatccac 192 0 
ccccgggagg ccgtggaccc cctgatgcgc 1980 
ctctacggca tgtcggccca ccgcctctcc 2040 
caggccttca ttgagcgcta ctttcagagc 2100 
accctggagg agggcaggag gcgggggtac 2160 
gtgccagacc tagaggcccg ggtgaagagc 2220 
aacatgcccg tccagggcac cgccgccgac 2280 
cccaggctgg aggaaatggg ggccaggatg 2340 
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ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 184 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 184 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Glu Thr Glu Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
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245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 
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Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 185 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 185 

ccggggaaag tcctcctccg tctcggcccg gcccgcctt 3 9 
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<210> 186 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 186 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 60 0 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 7 80 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 90 0 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 10 80 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 0 0 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 126 0 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 144 0 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 150 0 
cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
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2517 



<210> 187 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 187 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 188 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 188 

cgggacctcg aggcgcgtga accccaggag gtccac 36 

<210> 189 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 189 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


12 0 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


c 99cgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 
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agcctcctcc acgagttcgg ccttctggaa 
cccccgccgg aaggggcctt cgtgggcttt 
gatcttctgg ccctggccgc cgccaggggc 
aaagccctca gggacctgaa ggaggcgcgg 
gccctgaggg aaggccttgg cctcccgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggaggcgg gggagcgggc cgccctttcc 
cttgaggggg aggagaggct cctttggctt 
gtcctggccc atatggaggc cacgggggtg 
tccctggagg tggccgagga gatcgcccgc 
caccccttca acctcaactc ccgggaccag 
cttcccaagt tgaagaagac gaagaagacc 
gaggccctcc gcgaggccca ccccatcgtg 
aagctgaaga gcacctacat tgaccccttg 
ctccacaccc gcttcaacca gacggccacg 
aacctccaga acatccccgt ccgcaccccg 
gccgaggagg ggtggctatt ggtggccctg 
gcccacctct ccggcgacga gaacctgatc 
acggagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 
ctcatgaagc tggctatggt gaagctcttc 
ctccttcagg tcgccaacga gctggtcctc 
gcccggctgg ccaaggaggt catggagggg 
gaggtgggga taggggagga ctggctctcc 

<210> 190 
<211> 839 
<212> PRT 

<213> Artificial Sequence 



agccccaagg 


ccctggagga 


ggccccctgg 


900 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


ggtaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gactatagcc 


agatagagct 


cagggtgctg 


I860 


cggguccccc 


aggaggggcg 


ggacatccac 


1920 


c c c cgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 190 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
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305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
62 5 630 635 640 



358 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 191 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 191 

ctcctccacg agttcggc 18 



<210> 192 
<211> 48 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 192 

accggtcttc ttcgtcttct tcaacttggg aagcctgagc tcgtcaaa 4 8 
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<210> 193 

<211> 33 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 193 

aagacgaaga agaccggtaa gcgctccacc age 33 



<210> 194 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 194 

gtcgactcta gatcagtggt ggtggtggtg gtgcttggcc gcccggcgca tc 52 



<210> 195 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> (19) . . (42) 

<223> The bases in these positions within this primer 

are 91% of the base shown and 3% each of the other 
3 nucleotides . 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 195 

ggagegctta ccggtctttt gegtcttett gatcttggga agecttaget cgtcaaagag 60 



<210> 196 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 196 

ctcctccacg agttcggc 18 



<210> 197 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 



Sequence 
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<221> modif ied_base 
<222> (19) . . (42) 

<223> The bases at these positions within this primer 

are 91% of the base shown and 3% each of the other 
3 nucleotides . 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 197 

caaaagaccg gtaagcgctc caccagcgcc gccgtcctgg aggccctccg cgaggcccac 60 

<210> 198 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 198 

gtcgactcta gatcagtggt ggtggtggtg gtgcttggcc gcccggcgca tc 52 

<210> 199 
<211> 2517 
<212> DMA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 199 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


C999ccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


C9 a 9gcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 
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— \ /**T/*i y^i 4— /*i /*i -1— /*i >— i 

ayC CUCCUCC 


acgagtt egg 


cc tt c tggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


O. A A 

you 


CCCCCgCCgg 


aaggggcett 


cgtgggcttt 


gtgctttccc 


gc aaggagc c 


catgtgggcc 


960 


gat cttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


egagecttat 


102 0 


aaagccctca 


gggacctgaa 


ggaggegegg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


c c c cgagggg 


gtggcccggc 


getaeggegg 


ggagtggacg 


12 00 


gaggaggcgg 


gggageggge 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


tacegggagg 


tggagaggee 


cctttccgct 


i o n 
1 6Z U 


gtcctggccc 


atatggaggc 


caegggggtg 


cgcctggacg 


tggectatet 


cagggecttg 


13 80 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


cacccctt ca 


acctcaactc 


ccgggaccag 


c tggaaaggg 


tcctctttga 


cgagctaagg 


T c a A 
IbU U 


cttcccaaga 


tcaacaagac 


gaagaagacc 


ggtaageget 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gaetatagee 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


192 0 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


etteggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


O A A A 


caggagctag 


ccatccctta 


egaggaggee 


caggccttca 


ttgagegcta 


ctttcagagc 


O T A A 


ttccccaagg 


tgcgggcctg 


gat t gagaag 


accctggagg 


a 999 ca 99 a 9 


gcgggggt ac 




9tggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggeccg 


ggtgaagagc 


222 0 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


O O O A 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gecaaggage 


accaccacca 


ccaccac 


2517 



<210> 200 

<211> 839 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 200 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
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305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys He Asn Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
62 5 630 635 640 
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Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 201 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 201 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

365 



accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


/inn 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagugg 


d4 U 


gccgactacc 


gggccc tgac 


eggggacgag 


uccgacaacc 


ttcc egggg n 


caagygcauc 


£ n n 

OUU 


g999 a 9 aa 9 a 


cggcgaggaa 


gcttctggag 


gageggggga 


gecuggaage 


/^i "h~ /— < r~i 4~~ /-i *— % -3 /— f 

ccuccuc aag 


c ^ n 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


uggcccacau 


ggacgauc eg 


/ z u 


aagc tct cct 


gggacctggc 


caaggtgege 


accgacctgc 


c c c t ggagg t 


ggac ttcgee 


"7 P fi 


aaaaggcggg 


agcccgaccg 


ggagaggc 1 t 


agggecttte 


t ggagaggc t 


tgagt ttggc 




agcctcctcc 


acgagtt egg 


cctt ctggaa 


agccccaagg 


c cc tggagga 


ggcccccugg 


q n n 
yuu 


cccccgccgg 


aaggy ycc 1. 1. 




y tgcuutccc 






jDu 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgt.ee 


accgggcccc 


cydyect udu 


1UZU 


aaagccctca 


gggacctgaa 


99 a 99 c 9 c 99 


gggcttctcg 


ccaaagacct 


gagegtt ctg 


t r\ O A 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


114 U 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


getaeggegg 


ggagtggacg 


12 0 0 


gaggaggcgg 


gggageggge 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


t o ei a 
Iz 6 U 


cttgaggggg 


a gg a g a ggct 


cctttggctt 


tacegggagg 


tggagaggee 


cctttccgct 


l^Sz U 


guccuggccc 


anauggaggc 


caegggggeg 


cgccuggacg 


/— f/T /-i /-i *H* «TI 4~ +— 

LygcctatCL 


CdyyycCLLy 




tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


") A A A 
144 U 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaagg 


1 C A A 

Id U U 


attcccaaga 


tcaagaagac 


gcataagacc 


ggtaageget 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gegagge c c a 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


IbzU 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


looU 


ct ccacaccc 


get tcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ct ccgat ccc 


1 / 4 U 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


i o r\ r\ 
loU U 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gaetatagee 


aga t agagc t 


cagggtgctg 


T O C C\ 

loo (J 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


192 0 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


etteggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


egaggaggee 


caggccttca 


ttgagegcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gegggggtae 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggeccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 
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ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 24 00 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 202 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 202 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg lie Pro Lys lie Lys Lys Thr His Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 
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Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 203 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 203 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 
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ocu^t-ciL-c tyy 


ee t aeey eae 


etteeaeyee 


e tyaayyy ee 


teaeeaeeay 


cc 999999 a 9 


ion 


ccggtgcagg 


eggtctaegg 


ct tegecaag 


agect cct ca 


aggccctcaa 


99 a 99 ac 999 


1 0 u 




tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 




gggtacaagg 


C999 cc 999 c 


ccccacgccg 


gaggacttt c 


cccggcaact 


cgccctcatc 


0 n r» 
6 U U 


aaggagc t y y 


tggacctcct 


9999 ct 99 c 9 


cgect cgagg 


tcccgggcta 


egaggeggae 


•J f A 

^ fa 0 


gacg tec tyy 


c c age c tggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


/! O f\ 

4zU 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


999ta.ee tea. 


t caccccggc 


ctggct ttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


54 0 




LJy y CCC tgaC 


cgyygacyag 


t ccgacaacc 


ttcccggggt 


c aagggca t c 


bUU 


yyyy ayaaya 


c 99 c 9 a 99 aa 


get t c tygay 


gagtggggga 


gectggaage 


cctcctcaag 


d c n 
boU 


aaee tyy aee 


99 CT -9 aa 9 cc 


eyeca t cegg 


gagaaga t c c 


tggcccacat 


ggacgat ctg 


Ton 


aayc tc tccu 


gygacc tggc 


caaygtgege 


accgacctgc 


ccc tggagg t 


ggacttcgcc 


H O A 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggecttte 


t99 a 9 a 99 ct 


tgagtttggc 


840 


agcctcc tec 


aegagttegg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


y uu 


cccccgccgg 


aaggggcett 


cgtgggcttt 


gtgctttccc 


geaaggagee 


catgtgggcc 


960 


/^T ^ 4~ y^t 4"" 4— /— * -1— /T/— Y 

CjdtcttCtyy 


ccc tggc cgc 


cgc c aggggc 


ggccgcgtcc 


accgggcccc 


egagecttat 


102 0 


aaagccctca 


gggacctgaa 


ggaggegegg 


gggcttctcg 


ccaaagacct 


9 a 9cgttctg 


1080 


gecctgaggg 


aaggee ttg9 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


/—1 4— «i-f 1 /-1 /—1 4— 4— 

ctyyacccut 


cyaacaccac 


ccc cgagggg 


gtggcccggc 


getaeggegg 


ggagtggacg 


1^ U (J 


gaggaggegg 


999 a 9 c 999 c 


cyccctttcc 


gagaggctct 


tcgccaacc t 


gcttaagagg 


T O £Z n 
IZD U 


curgaggggg 


agy ay aggc t 


cct t tygc t t 


t a c c gggagg 


t99 a 9 a 99 cc 


cctttccgct 






atatyyayyc 


caegggggtg 


cgcctggacg 


tggectatet 


cagggecttg 


1 0 0 n 


t c c c tggagg 


fc99 c cgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


y^l 1 « /-^ f—\ 4— 4— ^ ^ 

CaCCCCUUCd 


acct caact c 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


_ 4-. 4- _ _ ?~\ --^ -~, *>f4- 

e t LLL-Lddyt 


tgaagaagac 


gaagaagacc 


ggtaageget 


ccagcagcgc 


cgccgtcctg 


1560 


yayyeeet.ee 


ycyaygccca 


c ccc at cgtg 


gagaagat c c 


tgcagtaccg 


ggagctcacc 


162 0 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gecgaggagg 


9gt99ctatt 


ggtggccctg 


gaetatagee 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 
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acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 198 0 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 04 0 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 204 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 204 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp 

Asn Leu 

Leu Glu 
210 

Leu Lys 
225 

Lys Leu 

Val Asp 

Phe Leu 

Leu Glu 
290 

Gly Ala 
305 

Asp Leu 
Pro Glu 
Leu Ala 



Pro Pro 
370 

Asn Thr 
385 

Glu Glu 



Leu Leu 
Glu Val 



Gin Trp Ala 
180 

Pro Gly Val 
195 

Glu Trp Gly 

Pro Ala lie 

Ser Trp Asp 
245 

Phe Ala Lys 
260 

Glu Arg Leu 
275 

Ser Pro Lys 

Phe Val Gly 

Leu Ala Leu 
325 

Pro Tyr Lys 
340 

Lys Asp Leu 
355 

Gly Asp Asp 
Thr Pro Glu 



Asp Tyr 
Lys Gly 



Ser Leu 
215 

Arg Glu 
230 



Arg Ala Leu Thr 
185 

lie Gly Glu Lys 
200 

Glu Ala Leu Leu 



Lys lie Leu Ala 
235 



Leu Ala Lys Val Arg Thr 
250 



Arg Arg 
Glu Phe 



Ala Leu 
295 

Phe Val 
310 



Glu Pro Asp Arg 
265 

Gly Ser Leu Leu 
280 

Glu Glu Ala Pro 



Leu Ser Arg Lys 
315 



Ala Ala Ala Arg Gly Gly 
330 

Ala Leu Arg Asp Leu Lys 
345 

Ser Val Leu Ala Leu Arg 
360 



Ala Gly Glu 
405 

Lys Arg Leu 
420 

Glu Arg Pro 
435 



Pro Met 
375 

Gly Val 
390 

Arg Ala 
Glu Gly 



Leu Leu Ala Tyr 



Ala Arg Arg Tyr 
395 

Ala Leu Ser Glu 
410 

Glu Glu Arg Leu 
425 



Gly Asp Glu 
190 

Thr Ala Arg 
205 

Lys Asn Leu 
220 

His Met Asp 

Asp Leu Pro 

Glu Arg Leu 
270 

His Glu Phe 
285 

Trp Pro Pro 
300 

Glu Pro Met 

Arg Val His 

Glu Ala Arg 
350 

Glu Gly Leu 
365 

Leu Leu Asp 
380 

Gly Gly Glu 
Arg Leu Phe 



Leu Ser Ala Val Leu Ala 
440 



Leu Trp Leu 
430 

His Met Glu 
445 



Ser Asp 

Lys Leu 

Asp Arg 

Asp Leu 
240 

Leu Glu 
255 

Arg Ala 

Gly Leu 

Pro Glu 

Trp Ala 
320 

Arg Ala 
335 

Gly Leu 

Gly Leu 

Pro Ser 

Trp Thr 
400 

Ala Asn 
415 

Tyr Arg 
Ala Thr 



Gly Val 
450 

Ala Glu 
465 

His Pro 



Asp Glu 



Arg Leu Asp 
Glu He Ala 



Phe Asn Leu 
485 

Leu Arg Leu 
500 



Val Ala 
455 

Arg Leu 
470 



Tyr Leu Arg Ala 



Glu Ala Glu Val 
475 



Asn Ser Arg Asp Gin Leu 
490 

Pro Lys Leu Lys Lys Thr 
505 



Leu Ser Leu 
460 

Phe Arg Leu 
Glu Arg Val 



Lys Lys Thr 
510 



Glu Val 



Ala Gly 
480 

Leu Phe 
495 

Gly Lys 
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Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 
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<210> 205 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 205 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 




gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccaccagcgc 


cgccctcctg 


1560 
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gaggccctcc 


gcgaggccca 


ccccdtcy Ly 


yciyctciytx l- l* l- 


t" nr* ^\ rri~ z^c* ca 


acr acr n 1~ c a c c 

y y v*« ^ <■■» ci x-* ^ 


1620 


aagctgaaga gcacctacat 


tgaccccttg 


ccggacctca 


iw k*- d. l< w ' ^— ' ^*y 


aaccrcrcrcccrc 
yciv^yy y v-v-^j^- 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


y y v-- L- d cty i_- o.y 


pt ccc/afcccc 

^ «w tA ^ Vw* 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 




crcrppt" 1~ pat" p 


1800 


qccqaqqaqq 


qqtqqctatt 


ggtggccctg gactatagcc 


cty ct i, cty cty l^ l. 


^^y yy ^y °y 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


■3 /^frra r~t f~r f~* 




192 0 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


fC'ffiTTfTSSPiT 1 
L^.(_,y Ly y o.'mL.l 


^L^u-yciL-y^-y^ 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


l. y L.k_.yyoL^L^ci. 


ppcrnpl - r*1" pp 


2040 


caggagct ag 


ccatccctta 


cgaggaggcc 


caggccttca 


i_ i_y cty *_»y l. d 


l_.L-L.L-L> cty cty *— • 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


a y yy c a yy a y 


gcggggguac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


yy ugaagagc 


99 9 0 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


l. l> t_. cty y y l^cil-- 


V— *y ^-*y l> L^y dv-» 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 206 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 206 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Leu Asp 
450 

Ala Glu Glu He Ala 
465 

His Pro Phe Asn Leu 
485 

Asp Glu Leu Arg Leu 
500 

Arg Ser Thr Ser Ala 
515 

He Val Glu Lys He 
530 

Thr Tyr He Asp Pro 
545 

Leu His Thr Arg Phe 
565 

Ser Ser Asp Pro Asn 
580 

Gin Arg He Arg Arg 
595 

Ala Leu Asp Tyr Ser 
610 

Gly Asp Glu Asn Leu 
625 

Thr Glu Thr Ala Ser 
645 

Pro Leu Met Arg Arg 
660 

Gly Met Ser Ala His 
675 

Glu Ala Gin Ala Phe 
690 

Arg Ala Trp He Glu 
705 

Val Glu Thr Leu Phe 
725 

Arg Val Lys Ser Val 
740 

Pro Val Gin Gly Thr 
755 

Leu Phe Pro Arg Leu 
770 



Val Ala Tyr Leu Arg Ala 
455 

Arg Leu Glu Ala Glu Val 
470 475 

Asn Ser Arg Asp Gin Leu 
490 

Pro Lys Leu Lys Lys Thr 
505 

Ala Leu Leu Glu Ala Leu 
520 

Leu Gin Tyr Arg Glu Leu 
535 

Leu Pro Asp Leu He His 
550 555 

Asn Gin Thr Ala Thr Ala 
570 

Leu Gin Asn He Pro Val 
585 

Ala Phe He Ala Glu Glu 
600 

Gin He Glu Leu Arg Val 
615 

He Arg Val Phe Gin Glu 
630 635 

Trp Met Phe Gly Val Pro 
650 

Ala Ala Lys Thr He Asn 
665 

Arg Leu Ser Gin Glu Leu 
680 

He Glu Arg Tyr Phe Gin 
695 

Lys Thr Leu Glu Glu Gly 
710 715 

Gly Arg Arg Arg Tyr Val 
730 

Arg Glu Ala Ala Glu Arg 
745 

Ala Ala Asp Leu Met Lys 
760 

Glu Glu Met Gly Ala Arg 
775 



Leu Ser Leu Glu Val 
460 

Phe Arg Leu Ala Gly 
480 

Glu Arg Val Leu Phe 
495 

Lys Lys Thr Gly Lys 
510 

Arg Glu Ala His Pro 
525 

Thr Lys Leu Lys Ser 
540 

Pro Arg Thr Gly Arg 
560 

Thr Gly Arg Leu Ser 
575 

Arg Thr Pro Leu Gly 
590 

Gly Trp Leu Leu Val 
605 

Leu Ala His Leu Ser 
620 

Gly Arg Asp He His 
640 

Arg Glu Ala Val Asp 
655 

Phe Gly Val Leu Tyr 
670 

Ala He Pro Tyr Glu 
685 

Ser Phe Pro Lys Val 
700 

Arg Arg Arg Gly Tyr 
720 

Pro Asp Leu Glu Ala 
735 

Met Ala Phe Asn Met 
750 

Leu Ala Met Val Lys 
765 

Met Leu Leu Gin Val 
780 
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Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 



Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 207 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 207 

gtcggagggg tcccccacga g 



<210> 208 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 208 

tgtggaattg tgagcgg 



<210> 209 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> (28) . . (59) 

<223> The bases in these positions within this primer 

are 91% of the base shown and 3% each of the other 
3 nucleotides . 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 209 

ctcgtggggg acccctccga caacctcccc ggggtcaagg gcatcgggga gaagaccgcc 
ctcaagcttc tcaag 



<210> 210 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



378 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 210 

gtggcctcca tatgggccag gac 23 

<210> 211 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 211 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccg 


aggggtcagg 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


rtcaaaaacc 


t craac CQQcrt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


qqcccacctcr 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 
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caggccc ex l 


ccc egg age u 


■f - ft ft ft ft Ti (T/t-3 

cgeggaggag 


acccgccgcc 


Lcgaggdggd 


rift\~ ft 4- 4- *-i f-fi^t 

yy uv—L. LL^yt 


1 440 

J_*±*± \J 


c eg y eg gg c c 


accccc ccaa 


ccc caacccc 


egggaccay c 


^yy yyy 




lJUW 


gage 1 1 aggc 


t tcccgcctt 


9999 aa 9 ac 9 


caaaagacag 


ft ft ti ft ft fr ft 4— i^i 

gcaagcgccc 


ft ^ ft ft -Ti ft ft ft ft ft 

cac cagey cc 


IjDU 


*-*r s**r/~r of 01 "f - of Of 

geggegcugg 


aggcccuacg 


ft ft ft ft /~i /~i ■a <"i 

ggaggcccac 


cccaucgugg 


ayddydUCCC 


ftftzi ft ft a ft ft ft ft 

cca.gca.ccgy 


1DZU 


Of *\ s~i 4"~ Ol Ol f?\ 

gage c. caeca 


agctcaagaa 


ft f-\ « 4~~ O* Of 4"" Of 

cacccacgcg 


gaccccc ccc 


^ •a ft f* ft 4— y-</^4- 

CddyCCtCtjt 


ft ft -~\ ftfiftft-rz ft ft 

cca.ee cya.yy 


1DOU 


acgggccgcc 


t ccacacccg 


cr ccaaccag 


acggccacgg 


/— i /~i ft fx ft ft fr 

ecaeggggag 


ft ft +" +* ^ ft\~ T5 ft ft 

gc c c ay t aye 


1 "74 Pi 


tccgacccca 


acctgeagaa 


catccccgt c 


o* or o* ^ oi ot oi o* o< 4~- 

cgcacccccu 


4— or or or 01 01 ^ Of ^ Of 

c 999 cca 9 a 9 


/ — f 4- /~ 1 |» — 1 /-1 ^-i ft ft 

gacccgccgg 


1 pnn 

IOUU 


gect tcgtgg 


c egaggeggg 


4— 4— /^f /— *- 4— 4~* Or 

ecyggege eg 


ofh* or of o o< o* "h" of of 

gcggccccgg 


-3 /n 4- -1 4- -3 ft ft ft 

accacagcca 


/^f^ 4- d /-r d ft ft 4-" 1^1 

gacagagc cc 


lOOu 


ft ft ft ft \— /—i -1— ft ft 

cgcgcccccg 


/~< ft ^ ft f* 4-* ft\~ ft 

cccacccccc 


ft ft ft ft ft ^ ft ft^ a 

ey yyy dey dd 




nrrrrt" 4~ 4— ^ r-i -3 

yyy l-cccl 


y y dy y y y ct a.y 




gacaeccaca 


cccagaccgc 


aagc t ggdug 


4— ■ 4— f\ f~t s~r /-i i-* 4— /-i /—i 

cccggcg ccc 


ft ft ft ft ft ft ^ ft ft ft 

ccccggo.ggc 


"H /~rr~T a f* ft ft ft 

cy uyyo.ee tc 


x ^ 0 VJ 


/i 1^ — l 4— /*-»•■ Ol Or OJ Ol 

c cgaegcgcc 


j~T/~*rs~*r ocor oi oi oi T\ 

gggcggccaa 


gaeggegaac 


cccggcgccc 


4— /-1 4- ^ /-1 f* ft ft -rj 4— 

ucudcyycdt 


fft~ ft ft ft ft f n^t- 

gcccgcccdc 


Z U f± u 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggegg 


tggectttat 


agagege tac 


ziUU 


ttccaaagct 


tccccaaggt 


geggge ct gg 


atagaaaaga 


ccctggagga 


ggggaggaag 


zloU 


eggggctacg 


tggaaaccct 


etteggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


0 0 o n 


gtgaagagcg 


t c agggaggc 


cgcggagcgc 


atggcct tea 


acatgcccgt 


ccagggcacc 


zzoU 


ft ft ft ft ft ftft^ /-^* /-i 

gccgccgacc 




cgccdt ggtg 


3 3 nr< 4— /^i 4— 4— fi 

O-cigCCCCCCC 


oocgcccccg 


ggagaugggg 




gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


eggctttgge 


caaggaggee 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 212 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 212 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 
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Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Arg Gly Val Arg Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 
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Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 
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Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala 
785 790 795 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala 
805 810 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 213 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 213 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agtataccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 
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Met Ala 

Leu Ala 

Met Leu 

Arg Ala 
800 

Tyr Pro 
815 

Trp Leu 



a t cr t Qcrcrc crcr 


acre tfcaaacrp 


pp f crcrc r* crc r* 


gaccccttgg 


pcrcrcrcrp 1~ a a a 


y y cli^ l. cicLy 


ac c cr t c t 1* crcr 


eft" pcra crcr era 


y y y y ^- >- ciy cic 


eject ace tec 


t~ crcra p p p r i~ p 


y o.ci^ cl^ 


cracrtcrcTapcrcr 
y ^-y "-yy^-^yy 






cttaaararr 

*w CitAM V* # ^-j \^ 


f- ccra. crcr crcr era 
i-^y ayy yy y ct 


yy ctya.cty ^ 


ct ctcccggg 


4- crcr rrr a 




caggccct t t 


c c p t crcra err* t~ 


y y y ay y ciy 


1 1 crcr p acrcr p c* 
u y y ^yyy ^ v - 


d V-r ^> l^i L- LoV_>ClCl 


ppl" ra a pi - pp 


a acr c 1 1 acrcr c 


"h t~ c c cere* c t~ i~ 


yyyy^^-y^-'-y 


cr c crcr tcrct era 
j^jj ^y ^ '-y y 


acrcr c c c t a p cr 


y y a y y^^vdL. 


gage t c a c c a 


acre teaacraa 


pa ppt" a pcrtrr 


acacraccacc 


tccaoaccpcr 


• i— ^ ciclw ^ciy 


tccgacccca 


acctacaaaa 

tAWy L»MOUMCiLC4 




acrft r*crt"crcr 


p 1 PTia rrrrrrrrrn 


u L yyy ^y ^ty 


ccrccrfccpt - per 


^ w v_» Cl\_» L.V_> 


^-y y yy ct^y ctci 


gacatccaca 


cccacrappcrp 


a a crp t~ crcra i - cr 
dcty u i_-yy duy 


ctgatgcgcc 


crcrcreicrcrppa a 
y y y ^y y ^ 


era ccfcrf - era a r 1 


acractctccc 


acrcracrpt - t~crp 




ttccaaagct 


tpcpna acrcr1~ 


cr p cr crcr r» c* t* cr cr 

y^yyy^^t-yy 


c crcrcrcr c t a c cr 


t~ crcra aarrft" 


i— L- ^--y y ddy d 


crt craaaacrccr 


t c acr crcr acrcr c 
*- ^ °-y y y ^-y y ^ 


pcrpcrcra crc crc 

u y ^ y y a y ^y k - 


gccgccgacc 


tcatcraaact 


c crp pa t~crcri"cr 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


gaggaggtgg 


eggctttgge 


caaggaggee 


ctggaggtgg 


aggtggggat 


gggggaggac 


caccac 







<210> 214 

<211> 842 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



j- err* a crcrcrcrccr 


ptp< Prrrrf i" /^r/~< ^ 

yocyLy Lycd 


ccgggcagca 


1 d n n 

lUz U 


era crn\~ p* ^tt^tct 

y*3.yytccggg 




caaggacct c 


T Cl O Ct 

luoU 




gggacgaccc 


catgctcctc 


114U 


cccgaggggg 


ty' c J cc J c y'y' c 9' 


etaegggggg 


12 0 0 


p , i - P 1 P 1 1 — r^l - ^ <~it~r 


dgciyy c ccca 


t eggaaect c 


1 npri 

lz o U 


p 1" 1" t" erer p "h p 1" 


a ppa p'prapTPTT - 
aLLdLyayy u 


yyddddyece 


1 ion 


cil t^yyyy Ldu 




ggee tacct t 


lioU 


a I - CP 1 PTP t P , (TP 1 P^ 


tcgaggagga 


ggtct teege 


1 /i /i n 

±44 U 


ccrcrcra p 1 p 1 a ptp 1 
^-y y y d^\— cty ^ 


tggaaagggr 


get ct t tgac 


t cr n ci 


P 4 2 3 3 2 iT aria pf 

LddaaydLdy 


geaagege t c 


caccagcgcc 


1560 


prrafprff rrpr 

OLLdCLytyy 


agaagacccu 


ccagcaccgg 


J_6z U 


crae , e i p , e , e ,i l - p , p' 


LddtJCCCCyt 


ccacccgagg 


I PQ 

lb o U 


a /~i ptptp 1 p< a /~>rrn 

cn_yyuCdCyg 


ecaeggggag 


gcttagtagc 


T ""7 /I Pi 

1/4 0 


p*ptp^ a f ^ /~» /~i /^i ■)— 

oyt/dcccccc 


t999' cca 9 a 9' 


gatccgccgg 


1 OA A 

1800 


y tggccccgg 


act at age c a 


gatagagctc 


186 0 




gggtcttcca 


99 a 9999 aa 9 


192 0 


■f- -f- P'pTPTP'PT'h* P* pi 

u LLy y t^y L-t_u 


ccccyydyyc 


cgtggacccc 


T Q O f\ 


+- f- P'PTPTP'Pfh P 1 P» 

u LLyy ^-y 


4— +— — i y^i /~rs~*/-i -a 4— 

LOidcyycau 


gt ccgcccat 


o Pi a r\ 


pt a ptpt a ptpt /~< rfrr 

gaggaggegg 


uggee c uuau 


agagegctae 


zlUU 


a h ana a aapra 
d L-dy dctdcty d 


ccc tggagga 


9999 a 99 aa 9 




a ptptp 1 ptp* t* a /-»it 

ay y Ly ctdcy 


cgcccgaccu 


caacgcccgg 


O O O A 


a i - ptptp 1 p* +~ p 1 a 
duyy^-v^u L. ^ ct 


a pi a "I - rtn /~i /~i <~t ■f - 

dLduyuccyu 


ccagggcacc 




aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 
2526 



Synthetic 
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<400> 214 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Tyr Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
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325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 215 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 215 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 48 0 
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ccccraacrcrcc 

*np* ^1 ^-j >--* w 




L-L-L-y y dy Lyy 


p ■}- i - ■(- p/p-p^a pra 
v_. l l Ly y y dy d 


a r~rf~ a p~>p*rrTr^ y* I - 

dy LdLyyCLt 


caggceggag 




c acr t crcrcr t crcr 


a p t" t~ p p cr p err* 
ca.\_- v_ ^-y *» -, y >■» 


LfV_ LL»y Lyyyy 


("rapnppf" p pp 
y CILL L. L. ULLy 


apa a^pfrpr 1 


c yyy9 T:caa g 


oUU 


yy L-d L^ayy y 


dyddydooyo 


L-v_ LL-ddy L* L.L. 


p 4— p a a pp'a 

LiLaayyayu 


/*f pf/^fof a a ^ 4~ 

y£f99 aa 9 ccu 


ggaaaacctc 


c~ <r r\ 

bbU 


\— • L- ci cy ctciu^ 


t~ nrra p" , p" , ptpypt , I _ 

L yy ciL'L'yyy i- 


dddy v_ l. cty dd 


ddLtjuccgyy 


dydagdLcaa 


ggcccacctg 




IT 3 a P 1 3 nynt" a 

y d cty cH_C_UL.ci 


/~-r fT "I - /— l 4~~ /"l 4— 4— 

yyctctcciu 


ggagct ctucc 


^-99-?tgcgca 


ccgacctccc 


cctggaggtg 


780 




a rrPTPTPTP' 1 ptpt pra 

^yyyy^yyyc* 


yLLLydLcyg 


r~r a /T rT /TP 1 p 4- 4- a 

ya-yyygcuua 


y y gc cttccu 


ggagaggctg 


Q / 


crscrt" 1~ pcrcrr*;^ 

y Q.y vvH-jw ci 


yv_,\_ l_. v_ v_. l_. \_v_ci 


l. y dy l. l L.y y l. 


p ■}- p p f- pp a pp 

LiLLiyy dy y 


o 1 f~* f* pptp /—i 
LLLLCyLLCC 


cceggaggag 


o n n 


crrTT 1 c c t* ci err* 

y w >— ' v^- ^ i*yy \-> 


LiUi^uy LLyy d 


dy yyy ll l l-l. 


pf4- ppp;p 4- 4— p p 


ILLLLlLLCy 


ccccgagccc 


Q £T n 


cn-y ^-yyy^-yy 


cty o l l ctct cty L. 


L-L-LyyL.L*yL.L- 


tgcaggggcg 


y C-Cy L^guy ca 


ccgggcagca 


lUz U 


rra prr pi - 1* nrf 
ycLL\-LOL uy y 


cyy y y o l ctctct 


yydLL LLrddy 


gayytceggg 


gccucc uege 


caaggacctc 


T n O Pi 


i—f /—i /-i p+- p"*"f- ptpt 
y coy i_ou Lyy 


ccicgaggga 


y gy y c uayac 


cucy ugcccg 


gggacgaccc 


catgctcctc 


T 1 /! Pi 

114 U 


C~f /*< p 4- ari/rl- f*/"' 
yLL LctUL LLL 


4— p:p; a r~< -I - 4- /—i 
Lyy clL-L-L. L IL 


yaaLdLLaLL 


cccgaggggg 


tggcgcggcg 


etaegggggg 


±z U U 


y dy Cy y dcy y 


ciygcicy cege 


ccaccgggcc 


c t cc tct egg 


agaggctcca 


tcggaacctc 


1 ')/* A 


L. L Lddy L.y OL. 


l l. y ciy y y y y ci 


yydydayc L C 


4— +— 4— p'p-p 4— »— 1 4— 

ccttyycuCL 


dccacgaggL. 


ggaaaagece 


i oon 
z U 


p ■}— /— i •(- ^-i /— i /-< ppp 1 

LLLLCCcyyg 


ccctyyccca 


uauggaggee 


accggggcac 


ggcgggacgt 


ggcctacctt 


t t on 

13 80 


y y L- L. L. Ill 


LLLLyy cty t_ l 


tgcyLjciyy ciy 


dLCcgccgcc 


tcgaggagga 


ggtc ttcege 


144U 


i~ 4- ptptp^ptptptP" 1 P 1 
l Lyyoyyyo l. 


cLL-L-L-OL-LL-cLct 


pp'l-paap'l" p p 
L.L. Lv_ddL- Itt 


p rtnrf a p p a p - p 

L-yygdCCdyc 


^- y g c* a. ggg e 


ycucu l ugac 


±D U U 


yciy o L i_ciyy L- 


4- +- /-i /-i i ^ — « -1 — -1 — 
L- Lv_L.C_LjCC_t. L 


ggy yciay acy 


p a a a a p"a p a /~ r 

CddddgdCdy 


gcaagcgcuc 


caccagcgcc 


lob U 


geggugcugg 


ayyccctaCy 


ggaggcccac 


cccat cgtgg 


agaagatcct 


ccagcaccgg 


lbz^ U 


y cty L. ILaLLu. 


cty L- L L-clciy del 


OaLL L dv_y Ly 


p 1 a i^i /~> /~> "H~ '~< 

yaCCCCCLCC 


CddgccucgL 


ccacccgagg 


±b o U 


dcy y y coy oo 


LLLaLdLLLy 


r^t 4- 4- a a /^i a of 

LllCddCCdy 


acggccacgg 


ecaeggggag 


gcttagtagc 


1 /4 U 


LLLy ClLLLLCl 


dL.v_ Ly Ldy dd 


P a i - p p pi prr - ! - p 
L-d ILOLLy LL 


p p 1 p a /-< /™» /~t 'H 
L.yLdLLLLLC 


tyy y CCdgay 


gacccgccgg 


loUU 


rrp P" 1 "h I - p^ptI - ptpt 

yLttuty Lyy 


ccgaggcggg 


l uy y y L*y l t y 


ytggcccugg 


acuauageca 


gat agagc t c 


1 QPA 

looU 


cgcgicc ucg 


y*^ y^i /~< ^ /-i +^ /-i 

CCCaCClCLC 


eggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


192 0 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


c c c eggagg c 


cgtggacccc 


1980 


c cgaugcgcc 


gggcggccaa 


gaeggtgaac 


ttcggcgtcc 


tetaeggcat 


gtccgcccat 


2 040 


«yyctctccc 


aggagcttgc 


catcccctac 


gaggaggegg 


tggectttat 


ac J a 9cgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


eggggctacg 


tggaaaccct 


etteggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 
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gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 
caccac 2526 



<210> 216 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 216 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Arg Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 
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Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 . 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

390 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 217 

<211> 2517 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 217 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
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(~*r*\~ a p^p^ptp* 1 a p^ 
u auuy oat 


U.U UU.v_.dL.yu.U- 


u uy ddy y y uu 


UUdUUdUUdy 


cc 999999 a 9 


ion 


ocyy uyudyy 


cggucuacgg 


?—\ 4- 4~ /~ f t~\ /~i \ 

cutcyccddy 


agee uuu uca 


aggcccucaa 


g9 a 99 ac 999 


1 on 
± OU 


yauycyyuya 


ucguggucuu 


ugacy ccaag 


gccccctcct 


t ccgccacga 


ggcctacggg 


o yi A 


yyy ududdy y 


cgggccgggc 


uu-uuduy uuy 


ydyydctLLC 


CCCyyCdaCt 


cyccctCduc 




a y y^-y c y y 


4— pvfnra P" 1 p 1 ^ r~< r^ 1 4~ 


y y y y ^ c a c y 


uyucucydyy 


Lcccyyycta 


egaggegga c 


t ^ n 


y ciuy u^^-uyy 


\~* u ay u u Lyy ^ 


uddy day y uy 


rra a a a rrpra npf 
y actdayydyy 


rTP 1 f" a pprarrprt" 

y u uduy dy y u 


uuyuduuuuu 


ft z u 


dU y U y d u d 


ddydUU UUUd 


riria /— f /—i 4- /-i /—i 4- 4— 

CCdyCLCCLU 


tccyaccyca 


uccacg uccu. 


ccaccccgag 


*± O U 


yyy uato Ltd. 


LUdLLuuyyc 


cuyy ci uuyy 


gaaaaguacg 


gcctgaggcc 


cgaccagtgg 




a /*i "H ra /"* 

yucyactdcc 


yyy cccugac 


eggggacgag 


uccgacaacc 


tt cccggggt 


caagggcat c 


£T n n 
b U U 


gy yyayaaya 


eggegaggaa 


gc tt c tggag 


gagtggggga 


gectggaage 


cctcctcaag 


boU 


-3 r-1 /-I 4— f-T/~r -a *-i /"I 

ddLLtyyaLC 


ygc ugaagee 


cgccauccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


/ z U 


ddy U LLLLLL 


yyy-iuuuyy u 


uddyyuycyc 


a /-< /~i/**r n /^(*i4- 

dccyacctyc 


ccc uggagg c 


ggac tt cgee 


■7 q n 
/ 0 U 


adaayycyyy 


a fir* /~i n /"^r a r*rv 

dycccydccy 


gg^g^ggcu u 


agggee uu uc 


tggagaggct 


tgagtttggc 




ciy UUUUUUUU 


arrfa rri~ t~ frrrt 

duydy u uuyy 


/-i i^t 4— 4- /-i 4— rT/^ra a 

uuu uuuyydd 


dyecucaagg 


cccuggagga 


ggcccccugg 


q a n 

yuu 


cccccyccyy 


--71 «««« /-i /— 1 4~~ 4~ 

ddyyyycct t 


eg u gggc cut 


gtgett tccc 


geaagg^gee 


catgtgggcc 




ydLLLLL tyy 


#~1 >-i ^-t 4~~ rffffi rT 

cccuyyccyc 


eye c syyyyc 


ggccgcgtcc 


accgggcccc 


egagee ttat 


T A O A 

JLUz U 


aaag C C C t C 3. 


gggacctgaa 


ggaggegegg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gc cc t gaggg 


aaggee ttgg 


cct cccgccc 


ggcgacgacc 


c cat get cct 


cgcctacctc 


1140 


uuyyduuuuu 


LyddudLUdL 


ccccgagggg 


guggcccggc 


gc uaeggegg 


ggagtggacg 


T O A A 


pta rrrr a nnrr^rn 

yayyayycyy 


PTPTpra rrprrrfrfp 

ggga-geggge 


frff 1 ^ 4- 4— 4— /-i /-i 

uyuuuuuuuu 


gdgaygu u c u 


ucy ccaacc u 


gc u uaagagg 


1 OCA 


cuugaggggg 


o-ggagaggcu 


/~< 4— 4- 4— rfrt /— 1 4- 4- 

ccuu uggcu u 


uacegggagg 


tggagaggee 


cctttccgct 


T *3 O A 


gucc uggccc 


auauggaggc 


caegggggug 


cgcctggacg 


tggectatet 


cagggecttg 


1380 


t ccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


(" , a/~ , f" , /-'/-i'f — 4— a 


aCC LCaaC UC 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagc t c agg 


1500 


pt4-4— pipipi-a-a /-* 4- 
ULL ULUady U 


cyddyddydC 


gaagaagacc 


ggtaageget 


ccagcagcgc 


cgccgtcctg 


T C ^" A 

lbbl) 


y yy u u u 


yc_gdggcccd 


uuuu.du.cguy 


yayaagaucc 


ugcagcaccg 


ggagct cacc 


T £T O A 


a a ctf* 4~ rta a <Ta 

ddycuydaya. 


gcacc tdCdu 


tgacccct tg 


ccggacctca 


tccaccccag 


gacgggccgc 


T O Pi 

1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gaetatagee 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 
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acggagac eg 


ccagctggat 


gt teggegtc 


c c c egggagg 




<— 1 ft 4— /->r -3 +— ft ft fx ft 


j, y o u 


cgggcggcca 


agaccatcaa 


etteggggtc 


ctctacggca 


tgucggccca 


v— t ft ft ft ft "i - /^i -j— /"i 

CCgCC L-C LCC 


o n a n 


caggagctag 


ccatccctta 


egaggaggee 


caggccttca 


ttgagegcta 


CLuuca.ga.gc 




tt ccccaagg 


tgcgggcctg 


gau ug agaay 


acccuyyayy 


a yyy ^-ayy a.y 


yegggggtae 


j£ _L O \/ 


gtggagaccc 


tctt eggecg 


ccgccgct ac 


gtgccagacc 


uagagy cccg 


ggugaagay o 


ooo n 
zzzu 


gtgegggagg 


cggccgagcg 


>— 1 ^ 4— y~*/— T y-H ^ 4— 4— /"i 

cauggccuuc 


aacacgcccg 


uccagggcac 


/-i/~t/-*i ft ft ft r<rf a /-* 

eyecy ccy ac 


zzou 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gecaaggage 


accaccacca 


ccaccac 


2517 



<210> 218 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 218 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 
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Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 
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<210> 219 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 219 

cgggacctcg aggcgcgtga accccaggag gtccac 3 6 

<210> 220 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 220 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 




180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


cgagacggag 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


c 9 a 99 c 99 ac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


gg a gtggacg 


1200 
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gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gc 1 1 aagagg 


i or n 


cttgaggggg 


aggagaggc t 


cctttggctt 


taccgggagg 


tggagaggcc 


cct ttccgct 


1 oon 

1J z u 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


noon 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1 A A A 

1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


lob U 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagat cc 


tgcagtaccg 


ggagctcacc 


lbZU 


aay cuyaay a 


vT/-1 /~l /~1 4— *a /~1 -~1 4- 

yCaCCtaCat 


4— /-r -a *-1 f~t 4— 4— /T 

tydCCCCtty 


ccy y acctca 


L.ccaccccag 


gacgggccgc 


i (Ton 


cuccacaccc 


gc l. tcaacca 


gacggccacg 


gccacgggca 


ggct aaguag 


cuccgauccc 




aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


i q a a 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


c 9 a 99 a 99 cc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


a 999 ca 99 a 9 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


222 0 


gtgcgggagg 


cggc cgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


"1 O O A 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 221 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 221 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 
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Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Glu Thr Glu Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 
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Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

Ala Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly He 



Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Asp Leu Glu Ala 
735 

Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 222 

<211> 39 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 222 

ccggggaaag tcctcctccg tctcggcccg gcccgcctt 3 9 



<210> 223 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 223 

gtcggactcg tcaccggtca gggc 24 



<210> 224 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> (28) . . (60) 

<223> The bases in these positions within this primer 

are 91% of the base shown and 3% each of the other 
3 nucleotides . 

<220> 



400 



<223> Description of Artificial Sequence: Synthetic 
<400> 224 

ctgaccggtg acgagtccga caaccttccc ggggtcaagg gcatcgggga gaggacggcg 60 
aggaagcttc tggag 75 

<210> 225 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 225 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


c gggg ac g a g 


tccgacaacc 


ttcccggggt 


caatggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


ggg a gcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 
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tccctggagg 


tggccgagga 




m L-uy ciy y v^^y 


acrat ptt cccr 


cctaaccQcrc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


Mty y ctcictyy y 




prrarrcl" CaCfO 

M M O-M M M 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggcaagcgcu 


m m ay m ciy Ly m 




1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


ydyddyaL c m 


uy tciy ucttv— y 


y y ciy \ t,^a^^ 


1620 


aagctgaaga 


f*Tf~* /I /■-* 4— ^ ^ "I - 

gCdCCLaCaL 


LyaLLLLL l^y 


nprtn3 ppi~ p^ 


L* V— « M* m m • 23 




1680 


ct ccacaccc 


/— f 4~* 4~~ « —s /~1 S~A ~—\ 

yCtCCaaCCa 


ydL-yyuOdLy 


y ^L.ciuyyy uct 




ctccgatccc 


1740 


aacctccaga 


acatccccgt 


CCyCaCCL-Cy 


/-i 4— 4- /-TfTrT (~t -\ fra 

m L- Lyyytdya 


yy d uuuy uuy 


ggcett cat c 


1800 


gccgaggagg 


ggtggctatt 




ydL McLuciy mm 


cty cl t_ cty G.y m l. 


tdyyy uy m Ly 


1860 


gcccacctct 


ccggcgacga 


yadCCtydLC, 


My y y lluill 




aa^Cri 1" rear 


1920 


acggagaccg 


ccagctggat 


yutcyycyuc 


ccccyyy ciyy 


r^rri - rrrr^ ccc 


r* p t* crat - crccrc 

«w l_ C^L ^1 L/ 


1980 


cgggcggcca 


agaccat caa 


/-i 4— +— /~i/t/-Ti'"ti'~t"H" /~i 

Ct Lcgyyy ll 


lll L-ctMy y mcl 




V—* ^-H ^ w 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


C-Cty y LL tLLd 


l uy cty vL. ex. 


pt* 1™ "hp^craop 

M I—- M L- M O-^j M 


2100 


ttccccaagg 


tgcgggcctg 


gat t gagaag 


acccuyyayy 


^ y y y o ^y y ^y 


crpcrcracrQt - ac 

y ^yyyyy 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


ytyccdydLL 


L. ciy cty y Louy 


crcrl - era 3 ctp rip 
yy i-ycLciycLy m 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


ddLaLyLtty 


L- o u cLy yyuau 


My M M ^-J M M ^ UM 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg tcgccaacga gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 226 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 226 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 
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Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Asn Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 
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Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 
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Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 227 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 227 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cgcagaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 
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aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 
gaggaggcgg gggagcgggc cgcccttt cc gagaggctct tcgccaacct gcttaagagg 12 60 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 8 0 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 
cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccagcagcgc cgccgtcctg 156 0 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 204 0 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 210 0 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 8 0 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 228 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 228 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 
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Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Gin Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 
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Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arq 
54 5 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 
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Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 229 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 229 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatg 600 
ggggagaaga cggggaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
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<210> 230 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 230 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly Met Gly Glu Lys Thr Gly Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 
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Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 
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Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 231 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 231 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
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gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 240 0 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 232 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 232 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Asn Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
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210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 
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Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arq Arq Gly Tvr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 



Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 233 

<211> 2517 

<212> DNA 

<213> Artificial 



Sequence 



417 



<220> 

<223> Description of Artificial Sequence: Synthetic 



^ A Hrts 0 ^ "5 

< £ ±\J[J> ZJO 

atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaagc 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 
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aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 24 0 0 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 234 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 234 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 
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Asp Leu Tyr Gin 
145 

Gly Tyr Leu lie 



Pro Asp Gin Trp 
180 

Asn Leu Pro Gly 
195 

Leu Glu Glu Trp 
210 

Leu Lys Pro Ala 
225 

Lys Leu Ser Trp 



Val Asp Phe Ala 
260 

Phe Leu Glu Arg 
275 

Leu Glu Ser Pro 
290 

Gly Ala Phe Val 
305 

Asp Leu Leu Ala 



Pro Glu Pro Tyr 
340 

Leu Ala Lys Asp 
355 

Pro Pro Gly Asp 
370 

Asn Thr Thr Pro 
385 

Glu Glu Ala Gly 



Leu Leu Lys Arg 
420 

Glu Val Glu Arg 
435 

Gly Val Arg Leu 
450 

Ala Glu Glu lie 
465 



Leu Leu Ser 
150 

Thr Pro Ala 
165 

Ala Asp Tyr 

Val Lys Gly 

Gly Ser Leu 
215 

lie Arg Glu 
230 

Asp Leu Ala 
245 

Lys Arg Arg 

Leu Glu Phe 

Lys Ala Leu 
295 

Gly Phe Val 
310 

Leu Ala Ala 
325 

Lys Ala Leu 

Leu Ser Val 

Asp Pro Met 
375 

Glu Gly Val 
390 

Glu Arg Ala 
405 

Leu Glu Gly 
Pro Leu Ser 



Asp Val Ala 
455 

Ala Arg Leu 
470 



Asp Arg 
Trp Leu 



Arg Ala 
185 

He Gly 
200 

Glu Ala 



Lys lie 

Lys Val 

Glu Pro 
265 

Gly Ser 
280 

Glu Glu 
Leu Ser 



He His Val Leu 
155 

Trp Glu Lys Tyr 
170 

Leu Thr Gly Asp 



Glu Lys Pro Ala 
205 

Leu Leu Lys Asn 
220 

Leu Ala His Met 
235 

Arg Thr Asp Leu 
250 

Asp Arg Glu Arg 



Leu Leu His Glu 
285 

Ala Pro Trp Pro 
300 

Arg Lys Glu Pro 
315 



Ala Arg Gly Gly Arg Val 
330 



Arg Asp 
345 

Leu Ala 
360 



Leu Lys Glu Ala 



Leu Arg Glu Gly 
365 



Leu Leu Ala Tyr Leu Leu 
380 

Ala Arg Arg Tyr Gly Gly 
395 

Ala Leu Ser Glu Arg Leu 
410 

Glu Glu Arg Leu Leu Trp 
425 

Ala Val Leu Ala His Met 
440 445 

Tyr Leu Arg Ala Leu Ser 
460 

Glu Ala Glu Val Phe Arg 
475 



His Pro Glu 
160 

Gly Leu Arg 
175 

Glu Ser Asp 
190 

Arg Lys Leu 

Leu Asp Arg 

Asp Asp Leu 
240 

Pro Leu Glu 
255 

Leu Arg Ala 
270 

Phe Gly Leu 

Pro Pro Glu 

Met Trp Ala 
320 

His Arg Ala 
335 

Arg Gly Leu 
350 

Leu Gly Leu 

Asp Pro Ser 

Glu Trp Thr 
400 

Phe Ala Asn 
415 

Leu Tyr Arg 
430 

Glu Ala Thr 
Leu Glu Val 



Leu Ala Gly 
480 



420 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 



Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 



421 



Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 

<210> 235 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 235 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


cc 999999 a 9 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


C9999 a C9 a 9 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


99 a 9tggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 



422 



tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtct t ccg 


cctggccggc 


1 A A Pi 
144U 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


t cctct ttga 


cgagct cagg 


i c; n n 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


n n a n 
loo U 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 




aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


iron 
±o o (J 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


c tccgatccc 


T H A Pi 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


-LcSUU 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


looU 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


a 99 a 9999 c 9 


ggacatccac 


1 qo n 

iyz u 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


iy ou 


cgggcggcca 


agaccatcaa 


c t t cgggg t c 


ct ct acggca 


tgt cggccca 


cgccct ctcc 


OA/in 


c aggagc t ag 


ccatccctta 


cgaggaggc c 


caggccttca 


ttgagcgcta 


ct tt cagagc 


£ i. u u 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


zlbU 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


o *i o n 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


ta gggg a 99 a 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 236 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 236 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 



423 



Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 



Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 



424 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Ala Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 



425 



Pro Val Gin Gly Thr Ala Ala Asp 
755 760 

Leu Phe Pro Arg Leu Glu Glu Met 
770 775 

Ala Asn Glu Leu Val Leu Glu Ala 
785 790 

Ala Arg Leu Ala Lys Glu Val Met 
805 

Pro Leu Glu Val Glu Val Gly He 
820 

Glu His His His His His His 
835 



<210> 237 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 237 

tagctcctgg gagagggcgt gggccgacat gcc 3 3 



<210> 238 
<211> 2517 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 238 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 



Leu Met 

Gly Ala 

Pro Lys 

Glu Gly 
810 

Gly Glu 
825 



Lys Leu 

Arg Met 
780 

Glu Arg 
795 

Val Tyr 
Asp Trp 



Ala Met 
765 

Leu Leu 

Ala Glu 

Pro Leu 

Leu Ser 
830 



Val Lys 

Gin Val 

Ala Val 
800 

Ala Val 
815 

Ala Lys 
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aacctggacc ggctgaagcc cgccatccgg 
aagctctcct gggacctggc caaggtgcgc 
aaaaggcggg agcccgaccg ggagaggctt 
agcctcctcc acgagttcgg ccttctggaa 
cccccgccgg aaggggcctt cgtgggcttt 
gatcttctgg ccctggccgc cgccaggggc 
gcggggctaa aggacctcaa ggaggtccgg 
gcctcgaggg aggggctaga cctcgtgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggacgccg cccaccgggc cctcctctcg 
ctcgaggggg aggagaagct cctttggctc 
gtcctggccc atatggaggc caccggggta 
tccctggagc ttgcggagga gatccgccgc 
caccccttca acctcaactc ccgggaccag 
cttcccgcct tgaagaagac gaagaagaca 
gaggccctac gggaggccca ccccatcgtg 
aagctcaaga acacctacgt ggaccccctc 
ctccacaccc gcttcaacca gacggccacg 
aacctgcaga acatccccgt ccgcaccccc 
gccgaggcgg gttgggcgtt ggtggccctg 
gcccacctct ccggggacga aaacctgatc 
acccagaccg caagctggat gttcggcgtc 
cgggcggcca agacggtgaa cttcggcgtc 
caggagcttg ccatccccta cgaggaggcg 
ttccccaagg tgcgggcctg gatagaaaag 
gtggaaaccc tcttcggaag aaggcgctac 
gtcagggagg ccgcggagcg catggccttc 
ctcatgaagc tcgccatggt gaagctcttc 
ctcctccagg tcgccaacga gctcctcctg 
gcggctttgg ccaaggaggc catggagaag 
gaggtgggga tgggggagga ctggctttcc 



gagaagatcc 


tggcccacat 


ggacgauccg 


Ton 


accgacctgc 


ccc uggaggt. 


ggacu tcycc 


/ o u 


agggccttt c 


^ggagaggcu 


tgay uuugyc 


RAD 


agccccaagg 






jp \J U 


gtgctttccc 


gcaaggagcc 


caugugggcc 


y bu 


ggccgcgtgc 


ac cgggcagc 


agaccccttg 


t a o a 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


ggggacgacc 


ccatgctcct 


cgcctacctc 


n 1 / A 

114U 


gfcggcgcggc 


gctacggggg 


ggagtggacg 


1ZUU 


gagaggct cc 


at cggaacct 


f"* /~i "f - 4"~ /~"-f /~1 

CCttddyCyC 


JL<£ b U 


taccacgagg 


tggaaaagcc 


cctctcccgg 


i *3 o n 


cggcgggacg 


tggcctacct 


tcaggccctt 


T3QA 


ctcgaggagg 


aggtcttccg 


c 1 1 ggcgggc 


i /i /i n 


ctggaaaggg 


4— j-t 4— — , 4— J— 4— — 

tgctcuttga 


cgagcttagg 


i c a a 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


-Lob U 


gagaagatcc 


tccagcaccg 


ggagct cacc 


t c o a 


ccaagcctcg 


tccacccgag 


gacgggccgc 


i c q a 


gccacgggga 


ggcttagtag 


ctccgacccc 


i n a a 
1 /4U 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1 O A A 

1800 


gactatagcc 


agatagagct 


ccgcgtcctc 


I860 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1 Q Q A 

iy OU 


ct c tacggca 


tgtccgccca 


u aggc ucccc 


O A A A 
ZU4U 


gtggccttta 


tagagcgcta 


c ttccaaagc 


O T A A 


accctggagg 


a 9999 a 99 aa 


9 c 9999 ctac 


O 1 C A 

z lb U 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 




aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gccaagggtc 


accaccacca 


ccaccac 


2517 
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<210> 239 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 239 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 
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Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 
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Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 



<210> 240 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 240 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 24 0 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 
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ctcatcaagg agctggtgga cctcctgggg 
gcggacgacg ttctcgccac cctggccaag 
atcctcaccg ccgaccgcga cctctaccaa 
cccgagggcc acctcatcac cccggagtgg 
cagtgggtgg acttccgcgc cctcgtgggg 
ggcatcgggg agaagaccgc cctcaagctc 
ctcaagaacc tggaccgggt aaagccagaa 
gaagacctca ggctctcctt ggagctctcc 
gacctcgccc aggggcggga gcccgaccgg 
gagttcggca gcctcctcca cgagttcggc 
gccccctggc ccccgccgga aggggccttc 
atgtgggcgg agcttaaagc cctggccgcc 
gagccttata aagccctcag ggacctgaag 
agcgttctgg ccctgaggga aggccttggc 
gcctacctcc tggacccttc gaacaccacc 
gagtggacgg aggaggcggg ggagcgggcc 
cttaagaggc ttgaggggga ggagaggctc 
ctttccgctg tcctggccca tatggaggcc 
agggccttgt ccctggaggt ggccgaggag 
ctggccggcc accccttcaa cctcaactcc 
gagctagggc ttcccgccat caagaagacg 
gccgtcctgg aggccctccg cgaggcccac 
gagctcacca agctgaagag cacctacatt 
acgggccgcc tccacacccg cttcaaccag 
tccgatccca acctccagaa catccccgtc 
gccttcatcg ccgaggaggg gtggctattg 
agggtgctgg cccacctctc cggcgacgag 
gacatccaca cggagaccgc cagctggatg 
ctgatgcgcc gggcggccaa gaccatcaac 
cgcctctccc aggagctagc catcccttac 
tttcagagct tccccaaggt gcgggcctgg 



tttacccgcc tcgaggtccc cggctacgag 360 
aaggcggaaa aggaggggta cgaggtgcgc 42 0 
ctcgtctccg accgcgtcgc cgtcctccac 480 
ctttgggaga agtacggcct caggccggag 540 
gacccctccg acaacctccc cggggtcaag 600 
ctcaaggagt ggggaagcct ggaaaacctc 660 
aacgtccggg agaagatcaa ggcccacctg 72 0 
cgggtgcgca ccgacctccc cctggaggtg 780 
gaggggctta gggccttcct ggagaggctg 840 
ctcctggagg cccccgcccc cctggaggag 900 
gtgggcttcg tcctctcccg ccccgagccc 960 
tgcaggggcg gccgcgtcca ccgggccccc 102 0 
gaggcgcggg ggcttctcgc caaagacctg 10 80 
ctcccgcccg gcgacgaccc catgctcctc 1140 
cccgaggggg tggcccggcg ctacggcggg 120 0 
gccctttccg agaggctctt cgccaacctg 1260 
ctttggcttt accgggaggt ggagaggccc 132 0 
acgggggtgc gcctggacgt ggcctatctc 13 80 
atcgcccgcc tcgaggccga ggtcttccgc 1440 
cgggaccagc tggaaagggt cctctttgac 150 0 
caaaagaccg gcaagcgctc caccagcgcc 156 0 
cccatcgtgg agaagatcct gcagtaccgg 162 0 
gaccccttgc cggacctcat ccaccccagg 1680 
acggccacgg ccacgggcag gctaagtagc 1740 
cgcaccccgc ttgggcagag gatccgccgg 1800 
gtggccctgg actatagcca gatagagctc 186 0 
aacctgatcc gggtcttcca ggaggggcgg 192 0 
ttcggcgtcc cccgggaggc cgtggacccc 19 80 
ttcggggtcc tctacggcat gtcggcccac 2040 
gaggaggccc aggccttcat tgagcgctac 210 0 
attgagaaga ccctggagga gggcaggagg 2160 
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cgggggtacg tggagaccct cttcggccgc cgccgctacg tgccagacct agaggcccgg 222 0 
gtgaagagcg tgcgggaggc ggccgagcgc atggccttca acatgcccgt ccagggcacc 2280 
gccgccgacc tcatgaagct ggctatggtg aagctcttcc ccaggctgga ggaaatgggg 2340 
gccaggatgc tccttcaggt cgccaacgag ctggtcctcg aggccccaaa agagagggcg 2400 
gaggccgtgg cccggctggc caaggaggtc atggaggggg tgtatcccct ggccgtgccc 2460 
ctggaggtgg aggtggggat aggggaggac tggctctccg ccaaggagca ccaccaccac 252 0 
caccac 2526 



<210> 241 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 241 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 
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Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 
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Leu Lys Ser Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp lie His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 



<210> 242 

<211> 2508 

<212> DNA 

<213> Artificial Sequence 



434 



<220> 

<223> Description of Artificial Sequence: Synthetic 



< i ± U U > 4* ft-s 

atggaattca 


ccccactttt 


tgacctggag 


gaacccccca 


agcgggtgct 


tctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttctat 


gccctgagcc 


tcaccacctc 


ccggggggag 


120 


ccggtgcaga 


tggtctacgg 


cttcgcccgg 


agcctcctca 


aggccttgaa 


ggaggacgga 


180 


caggcggtgg 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacgag 


240 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggacttcc 


cccgccagct 


cgccttggtc 


300 


aagcggctgg 


tggaccttct 


gggcctggtc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


gacgtcctgg 


gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


acgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 


gggaccctgg 


tcaccccaaa 


ggacgtccag 


gagaagtacg 


gggtgccccc 


ggagcgctgg 


540 


gtggacttcc 


gcgccctcac 


gggggaccgc 


tcggacaaca 


tccccggggt 


ggcggggata 


600 


ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


aacctggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


720 


ctccacctct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcggagg 


gccgggtcca 


ccgggcaaca 


1020 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc 


caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc 


cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg 


ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt 


ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg 


gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct 


tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt 


ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt 


tgacgagctg 


1500 


ggcctcaccc 


cggtgggccg 


gacgcagaag 


acgggcaagc 


gctccaccgc 


ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta 


ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc 


gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc 


cagctccgac 


1740 
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v_ v_- ctcn— ^ uy ^ 


dy cici O d t. C C C 


cguycyCa.cc 


ccct tggggc 


agcgcauccg 


caaggccttc 


loon 


gtggccgagg 


a ggggtggct 


ccule uggcg 


geggactact 


cccagattga 


gctccgggtc 


1860 


CLCjyCCCaCC 


ecueggggga 


cgagaacctg 


aagcggg t c t 


teegggaggg 


gaaggacatc 


1920 




ccgccgcc ug 


gatgtt egge 


ttagaccccg 


ctctggtgga 


tccaaagatg 


1980 


cgccgggcgg 


ecaagaeggt 


caac tt egge 


gtcctctacg 


ggatgtccgc 


ccacaggctc 


2 040 




ccyy Co. L,a.gci 


jT— 1 4— -~) -—\ -—^ T/^T ■— \ >f 

CtdCaagydy 


geggaggect 


ttattgagcg 


ctacttccag 


2100 


age LCCCCCa 


a gg^g c ggg c 


c tggatagaa 


aggaccctgg 


a gg a ggg cc g 


gaegegggge 


2160 




/-i < — < / — * i— /-T-l- i - r^f~rrr 
LLLty L-i_^yy 


ca gy a -yy c y c 


tatycycccy 


--\ /~i 4" rt/*f r( ^ /— t 

acccyycctc 


ccgggt cege 


O O O Pi 

A A A U 


teggugeggg 


o.ggcggcgga 


ycygaxggcc 


u ucaacaugc 


ccgtgcaggg 


caccgccgcc 


AAtiU 


gacctgatga 


agatcgecat 


ggtcaagctc 


ttccccaggc 


taaagcccct 


gggggcccac 


2340 


ctcctcctcc 


aagtgcacaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg 


ggecgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt 


gcccctcgag 


2460 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggegaage 


aggattga 




2508 



<210> 243 
<211> 835 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 243 

Met Glu Phe Thr Pro Leu Phe Asp 
1 5 

Leu Leu Val Asp Gly His His Leu 
20 

Ser Leu Thr Thr Ser Arg Gly Glu 
35 40 

Ala Arg Ser Leu Leu Lys Ala Leu 
50 55 

Val Val Phe Asp Ala Lys Ala Pro 
65 70 

Ala Tyr Lys Ala Gly Arg Ala Pro 
85 

Leu Ala Leu Val Lys Arg Leu Val 
100 

Glu Ala Pro Gly Tyr Glu Ala Asp 
115 120 

Lys Ala Glu Arg Glu Gly Met Glu 
130 135 



Sequence : 



Leu Glu Glu Pro 
10 

Ala Tyr Arg Thr 
25 

Pro Val Gin Met 



Lys Glu Asp Gly 
60 

Ser Phe Arg His 
75 

Thr Pro Glu Asp 
90 

Asp Leu Leu Gly 
105 

Asp Val Leu Gly 



Val Arg lie Leu 
140 



ic 



Pro Lys Arg Val 
15 

Phe Tyr Ala Leu 
30 

Val Tyr Gly Phe 
45 

Gin Ala Val Val 



Glu Ala Tyr Glu 
80 

Phe Pro Arg Gin 
95 

Leu Val Arg Leu 
110 

Thr Leu Ala Lys 
125 

Thr Gly Asp Arg 
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Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 
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Gly His Pro Phe 
Phe Asp 
Lys Arg 



Glu Leu 
500 



Ser Thr 
515 



Asn Leu 
485 

Gly Leu 
Ala Gin 



Asn Ser 
Thr Pro 



Arg Asp 
490 

Val Gly 
505 



Gly Ala 
520 



Pro lie 
530 



Val Glu Leu lie 



Ser Thr 
545 



Tyr Leu 
Arg Leu His Thr 



Asp Pro 
550 

Arg Phe 
565 



Leu Glu 
Tyr Arg 
Leu Pro Arg Leu 



Leu Gin 
535 



Asn Gin 



Ser Ser 

Gly Gin 

Leu Ala 
610 

Ser Gly 
625 

His Thr 

Asp Pro 

Tyr Gly 

Lys Glu 
690 

Val Arg 
705 

Tyr Val 
Ser Arg 
Met Pro 



Ser Asp 
580 

Arg lie 
595 

Ala Asp 
Asp Glu 
Glu Thr 



Lys Met 
660 

Met Ser 
675 



Pro Asn Leu Gin 



Arg Lys 

Tyr Ser 

Asn Leu 
630 

Ala Ala 
645 

Arg Arg 
Ala His 



Ala Glu Ala Phe 
Ala Trp 
Glu Thr 



He Glu 
710 



Lys Leu 
770 

Val His 
785 



Val Arg 
740 

Val Gin 
755 

Phe Pro 
Asn Glu 



Ala Lys Ala Leu 



Leu Phe 
725 

Ser Val 
Gly Thr 
Arg Leu 



Leu Val 
790 

Val Lys 
805 



Ala Phe 
600 

Gin He 
615 

Lys Arg 

Trp Met 

Ala Ala 

Arg Leu 
680 

He Glu 
695 

Arg Thr 
Gly Arg 
Arg Glu 



Thr Ala 
570 

Asn He 
585 

Val Ala 
Glu Leu 
Val Phe 



Phe Gly 
650 

Lys Thr 
665 



Gin Leu 

Arg Thr 

Ala Leu 

Glu Leu 
540 

Val His 
555 

Thr Ala 

Pro Val 

Glu Glu 

Arg Val 
620 

Arg Glu 
635 

Leu Asp 
Val Asn 



Ser Gin Glu Leu 



Arg Tyr 
Leu Glu 



Arg Arg 
730 

Ala Ala 
745 



Ala Ala 
760 



Asp Leu 
Leu Gly 
Leu Glu Val Pro 



Lys Pro 
775 



Glu Val 



Met Glu 
810 



Phe Gin 
700 

Glu Gly 
715 

Tyr Val 
Glu Arg 
Met Lys 



Ala His 
780 

Glu Asp 
795 

Asn Ala 



Glu Arg Val Leu 
495 

Gin Lys Thr Gly 
510 

Arg Gly Ala His 
525 

Ser Lys Leu Lys 



Pro Arg Thr Gly 
560 

Thr Gly Arg Leu 
575 

Arg Thr Pro Leu 
590 

Gly Trp Leu Leu 
605 

Leu Ala His Leu 



Gly Lys Asp He 
640 

Pro Ala Leu Val 
655 

Phe Gly Val Leu 
670 

Gly He Asp Tyr 
685 

Ser Phe Pro Lys 



Arg Thr Arg Gly 
720 

Pro Asp Leu Ala 
735 

Met Ala Phe Asn 
750 

He Ala Met Val 
765 

Leu Leu Leu Gin 



Arg Ala Glu Glu 
800 

Tyr Pro Leu Asp 
815 
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Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 



Lys Gin Asp 
835 



<210> 244 

<211> 34 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 244 

cttccagaac ctctttaaac ggctttccga gaag 34 



<210> 245 

<211> 34 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 245 

cttctcggaa agccgtttaa agaggttctg gaag 34 



<210> 246 

<211> 31 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 246 

ccggtgggcc ggacgcagaa gacgggcaag c 31 



<210> 247 

<211> 31 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 247 

gcttgcccgt cttctgcgtc cggcccaccg g 31 



<210> 248 

<211> 31 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 248 

ctcctccaag tgcacaacga gctggtcctg g 



31 



<210> 249 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 249 

ccaggaccag ctcgttgtgc acttggagga g 31 

<210> 250 

<211> 2499 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 250 

4— s~*r f-* *~\ 4— 4— /-i /— i 

dUyyadtLCC 


tgcccct ct t 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 


c tggcctacc 


gtacctttut 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


120 


caggcggtgt. 


acgggcu ugc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


gtgatcgtgg 


tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctgggaggga 


agggtttacc 


gggcggagga 


tcccttggag 


1020 


gccttgcggg 


ggcttgggga 


ggtgaggggg 


cttttggcca 


aggacctggc 


ggtgctggcc 


1080 


ctgagggaag 


ggattgccct 


ggcaccgggc 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 
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r~x -a 4- /-» /-i 4- 4- /-i — i 
ydLLLL LLLct 




cyday yggua 


gc.ccggcgc t 


ac 9999999 a 


gtggaccgag 


1 1 Aft 

Iz U (J 


yd-yy^yyyyy 


a a apffrprpprpi - 
dddyyyLyL u 


rTr^t" t* I - ppn'a a 
yLLLLLLydcl. 


dyy l l CLdcy 


/^i /~t ^1 r™i 4— /*i 4— 

ccyccctccE 


gaagegge 1 1 


1 OCA 


ciciy y y y y o.y y 


cty cLy yL-LL^L. 


4~ 4- /-t/-t<^i f ffap 
LLyyLUL LdC 


gaggaggegg 


aaaagccccu 


ttcgcgggtc 


1 T1A 

L5£ U 


<"i +- cinr* r* r* a a 
l LyyLuUaLa 


LyyayytLdC 


yyyyytacgg 


uuggaugegg 


cctactt aaa 


ggccctttcc 


*i 0 0 n 
X J oU 


cxggaggtgg 


a 9 9 ^ y 9 a 9 a 


aaggcgc tec 


9 a gg a 99 a gg 


tccaccgcct 


ggcegggcat 


1440 


/-i 4—4—4— /I —\ -a 


tgaac ucccg 


ggaccagctg 


gaaagggtca 


tctttgacga 


gcttgggctt 


1500 


cccgccatcg 


gcaagacgca 


gaagaeggge 


aagcgctcca 


ccagcgccgc 


cgttttggag 


1560 


gccc tgcggg 


ayyctCatcc 


cat cgtggac 


cgcatccttc 


agtaccggga 


gctttccaag 


1620 


l u vday y y ctct 


LLLo.LdL.Lyd 


4— /~t /~< 4- 4— f~ff* r* 4— 
LuLLUtyCCU 


f—t /~i /"I 4— /^r /— r 4— >-i 

yccctyytcc 


accccaagac 


gaaccgcctc 


1680 


pa pa rr pprt~ t~ 
dv_> \_> o y i_ 


fpaa ppana p 

L- LGLCLLLCLy CLL 


nrT(^ pap r^rxc* 

yyLLctLL.yLL 


dLyyyy dyy l 


l. udy cagcuc 


ggatcc taat 


1 /4(J 


L LM U CICL CtCl LGi 


L-LLLLy L.y Ly 


/—l 3 ft S~1 /"I /~t 4—4—4— f~r 

LaLLLLCCty 


ggccagcgga 


t ccgccgggc 


cttcgtggcc 


T O A A 


gaggaggggt 


t~rf~t ^ f~r/~r /~i 4— /-r/-* 4— 

ggaggctggc 


ggttttggac 


tacagecaga 


ttgagctcag 


ggtcctggcg 


1860 


e~t /*■• *-i 4— 4— 4- fi s-*s-*r 

CaCCtLLCCy 


ggga.cgagaa 


cctaatccgg 


gtcttccagg 


a 999ccagga 


catccacacc 


1920 


CagaCyyCCa 


gctggatgtt 


cggcgtgccc 


ccagaggccg 


tggattccct 


gatgegcegg 


1980 


gcygccaaya 


/-1 /-1 *rv +- y^t r^i 4~~ ^ 


cggcgtcc tc 


tacggcatgt 


ccgcccaccg 


gettteggga 


2040 


y dy l LggcccL 


uccccuacga 


gg&99 c ggtg 


gccttcatcg 


ageggtattt 


ccagagctac 


2100 


l LLddy y L 


gggcccggac 


tgagaaaacc 


ctggcggaag 


9 a cgggaacg 


gggctatgtg 


2160 


na a /-1 r-i /-< 4— /— 1 -f— 
y aaaLL.L ul L. 


4— 4— /~rs~T f~1 /-I T /— 1 

L- uyyccgccg 


gcgctacgtg 


cccgacttgg 


cttcccgggt 


gaagagcatc 


2220 


cgggaggcag 


cggagcgcau 


ggccttcaac 


atgccggtcc 


aggggacege 


cgcggatttg 


2280 


atgaaactgg 


ccatggtgaa 


gctctttccc 


aggcttcagg 


agctgggggc 


caggatgett 


2340 


ttgcaggtgc 


acaacgaact 


ggtcctcgag 


gctcccaagg 


ageaagegga 


ggaagtcgee 


2400 


caggaggcca 


^gcggaccat 


ggaggaggtg 


tggcccctga 


a 99tgccctt 


ggaggtggaa 


2460 


gtgggcatcg 


gggaggactg 


gctttccgcc 


aaggectag 






2499 



<210> 251 
<211> 832 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 251 

Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 



441 



Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Glu Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
355 360 365 



442 



Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Asp Arg lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr lie Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg lie Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Gin Asp lie His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

443 



Ala Trp lie Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser lie Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 



<210> 252 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 252 

gccgccctcc tgaagcggct taaggg 2 6 



<210> 253 
<211> 26 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 253 

cccttaagcc gcttcaggag ggcggc 26 



<210> 254 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 254 

atcggcaaga cgcagaagac gggcaagc 2 8 



444 



<210> 255 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 255 

gcttgcccgt cttctgcgtc ttgccgat 2 8 

<210> 256 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 256 

ttgcaggtgc acaacgaact ggtcctc 27 

<210> 257 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Synthetic 
<400> 257 

gaggaccagt tcgttgtgca cctgcaa 27 



<210> 258 

<211> 2526 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 258 
atgaattcca 


ccccactttt 


tgacctggag 


gaacccccca 


agcgggtgct 


tctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttctat 


gccctgagcc 


tcaccacctc 


ccggggggag 


120 


ccggtgcaga 


tggtctacgg 


cttcgcccgg 


agcctcctca 


aggccttgaa 


ggaggacgga 


180 


caggcggtgg 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacgag 


240 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggacttcc 


cccgccagct 


cgccttggtc 


300 


aagcggctgg 


tggaccttct 


gggcctggtc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


gacgtcctgg 


gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


acgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 



445 



yyyc 11 -^^ L -yy 


LLuLL LLdCtCL 


yy ciLy LLLdg 


fTa /**r a a it 4/" a /-*rN" 
yciy cidLj LdLy 


gggcgccccc 


ggagcgctgg 


n a r\ 


crtcraaci - 1~ pp 


CSCCJC i C i C i f' PaP 
yuyULL L LGLL 


CTCT(TrTfTa p /-tpfP 


4- (^if— f/-ra ^ a a f* a 
LLyydLddLd 


tccc cgy gg l 


ggcgggg a ta 


si r\ r\ 
b U U 


crcrcrcra. a rra 

yyy y^y ctcty a. 


LtyLLL L LLLJ 


dLLLLLCyCa 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


ciciuu Ly y 


yyy '-waayLL 


y y etc u uycu c 


eggegcaaga 


tagaggegea 


cctcgaggac 


72 0 


L LLLdLLLL L 


r~x /-~\ 4~ 4~ a ira /—« *—< 4- 

LLL LaydLLU 


yyCCCyCat C 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


?3£5crcTPPPt"crp 


crppcrpa rrrTa r* 
y LLy Lcty y dL 


LLLLy dLL Ly 


y a yyy cc ^9 a 


gggccttttt 


99 a 99 a 9ctg 


840 


y ciy ^y y ctct 


yLL LLL LLLcL 


l y ciy Lll y y l 


ccccugggag 


gggagaagc c 


cc 999 a 99 a 9 


9 00 


err* p p p p 1~ cr a p 

>W ^ l_- v^j ^-j ^ 


oLLLyLLLUCl 


cty yyyLLLLL 


r4~ fir* t<~rft 4~ 4~ /~i i^t 

y tgggcutcc 


tcct t tcccg 


caaggagccc 


n f r\ 
y D 0 


ct'-y >-yyy<-yy 


ay LLLLLyyL 


cccggcggcg 


gccccgggcg 


gccgcgtgca 


c eggge ag c a 


102 0 


(-ra /~i ^—i H — -1— /-V/— r 


egggge uaaa 


ggacctcaag 


gaggtceggg 


gcctcctcgc 


caaggacctc 


1080 




lc Lcyciggga 


9'9'99 c tagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


crppt" arrtrr 


"h crcra pppi"hp 

LyycLLLL L LL 


y cictLciLLdLL 


cccgaggggg 


tggcgcggcg 


etaegggggg 


12 0 0 


era (Ti~ cm a. fcrcr 

y «.y ^yydLyy 


o.y y ctLy LLy l 


LLdLLLjyy CC 


ct cct ct egg 


agaggctcca 


tcggaacctc 


1260 




LLy ctyyyyyd 


y y ciy cidy L LL 


/— i 4— 4~ 4~ rrrtn 4~ /~t 4— 

c l LEggc lcl 


accacgaggt 


ggaaaagece 


1320 


l l l LuL^yyy 


lll Lyyoccd 


LaLggaggcc 


aceggggtae 


9*9 c 9'9y" a cgt 


ggcctacctt 


1380 


p a rrn r~« / — ■ ^ — • i — i — i — 

V — ■ CLy yuLULLL 


l l l l y y cty l l 


E 9 c yy a y9 a 9 


at ccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


L- L-yyLgygcc 


dLLCC l ccaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


y cty l l l cty y l 


4- 4- p/*«pn'pr' , l"f" 
LLLLLyLLLL 


gctdyddgacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


Cfpcrcrt~erp1~eref 

y ( -yy L y t ' L yy 


cty y lll LctLy 


yydgy cccac 


cccaucgLgg 


agaagatcct 


ccagcaccgg 


1620 


n a err -1 "!"" pa ppa 

Ci^-H V^- I V. d. V U CI 


cty L L Lctciy da 


LdLLLdCyty 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


a f-*crcrcrp pctp p 


L LLctLclL LLy 


L Ltcaaccag 


acggccacgg 


ecaeggggag 


gcttagtagc 


1740 




dLLLgcagaa 


catccccgtc 


cgcaccccct 


tgggecagag 


gatccgccgg 


1800 


yLLULCy cyy 


ccgaggcggg 


tt 9*gy* c 9 tt g 


gtggccctgg 


actatageca 


gatagagctc 


1860 


LyL-y iLttcy 


CCCaCCCCUC 


eggggacgaa 


aacctgatca 


gggtcttcca 


gg a gggg aa g 


1920 


^CLLCtLLLdLd 


LLLCtyctLLyL 


ddgc Lyyaug 


t tcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


nfrts t" ctpctp r** 
v_* uyciLyLy ll 


yyycyyccaa 


gaeggtgaac 


ttcggcgtcc 


tetaeggcat 


gtccgcccat 


2040 


a CTCTP t" pt~ prr 


a rrpfa pfp 4— 4— 

ciyycty it lljc 


/'—i -^i 4— /-i >-i >-t 4— « 

caLccccEac 


g a ggaggcgg 


tggectttat 


agagegctae 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


9999 a gg aa 9 


2160 


eggggctacg 


tggaaaccct 


etteggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


a tggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 
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gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 240 0 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 

<210> 259 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 259 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 
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Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 
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Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly His His His His His His 
835 840 



<210> 260 
<211> 2514 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 260 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 
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c cyyecCdCC 


gcaccccc c c 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


12 0 


^^y y y *-y 


aeggg cccgc 


caagagcett 


ttgaaggege 


t aagggaaga 


cggggatgtg 


180 


gcgaccgcgg 


cgc ccgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


" y y ^* y y y y ^ 


yyy cccccac 


ccccgaggac 


ttt ccccggc 


agcttgccct 


tatcaaggag 


3 0 0 


ci.uyycyga.cc 


c c c egggee c 


ggagcgcctc 


gaggtgeegg 


gctttgaagc 


ggatgacgtc 


360 




cy y ceddgaa 


yy c yy aaaa g 


gaaggccacg 


aagtgcgcat 


cctcaccgcg 


42 0 


ga.ccggga.cc 


tttaccagct 


tettteggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


CtyaUCaCCC 


cggagtggct 


1 1 gggagaag 


tacgggctta 


agccttccca 


gtgggtggac 


540 


cdeey y y cc c 


eggcegggga 


ccctt ccgac 


aacatccccg 


gcgtgaaggg 


categgggag 


600 




ppa a rrf 4- — > 4- 

ceddy c cy dc 


cc ggg a g^gg 


gg aa gcctgg 


aaaaccttct 


taagcacctg 


66 0 


y act L*cty y L.y ct 


ddcccycccc 


cgegegggag 


aagatcc tta 


gccacatgga 


ggacctcaag 


720 


CtaUCCCtyy 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


eggaceggga 


ggggcttaag 


gectttttgg 


agaggctgga 


gtteggaage 


840 


c tec cccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


eggaggaage 


tccctggccg 


900 


ccccccgagg 


gagee t t cgt 


gggg ta cgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


c c Lddcyccc 


cggccgccgc 


ctggggcggc 


cgcgtgcacc 


gggcagcaga 


ccccttggcg 


1020 


gggctaaagg 


acctcaagga 


ggtcegggge 


ctcctcgcca 


aggacctcgc 


cgtcttggcc 


1080 


tcgagggagg 


ggccagacc c 


cgtgcccggg 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gaccc cccga 


acaccacccc 


cgagggggtg 


gcgcggcgct 


aeggggggga 


gtggacggag 


1200 


y dey ccy ccc 


dccgggccc c 


cct ct eggag 


a ggctccatc 


ggaacctcct 


taagcgcctc 


1260 


yd.yyyyyd.yg 


dy ddy c c c c c 


4- 4~ /~i 4- «4- -» ri 

cegge cccac 


cacgaggtgg 


aaaagcccct 


ctcccgggtc 


1320 


cyycccdcd 


^ggaggecac 


c gggg tac gg 


c 999 a cgtgg 


cctaccttca 


ggccctttcc 


1380 


ceggage c eg 


eggaggagat 


ccgccgcctc 


gaggaggagg 


tcttccgctt 


gg c ggg cc ac 


1440 


z"" 1 f* f 1 1~ "f - (■"■* a --j /— i 
CCC C C CddCC 


ccaacccccg 


ggaccagctg 


gaaagggtgc 


tctttgacga 


gcttaggctt 


1500 


cccgcc c cga 


agaagacgaa 


gaagacaggc 


aagcgctcca 


ccagcgccgc 


ggtgctggag 


1560 


y ccccacygg 


aggc c c acc c 


catcgtggag 


aagatcctcc 


ageaceggga 


gctcaccaag 


1620 


cccaagaaca 


cctacgtgga 


ccccctccca 


agcctcgtcc 


acccgaggac 


gggccgcctc 


1680 


cacacccgct 


tcaaccagac 


ggccacggcc 


a ^ggggaggc 


ttagtagctc 


cgaccccaac 


1740 


ctgeagaaca 


tccccgtccg 


cacccccttg 


ggccagagga 


tccgccgggc 


cttcgtggcc 


1800 


gaggegggtt 


gggcgttggt 


ggccctggac 


tatagecaga 


tagagctccg 


cgtcctcgcc 


1860 


cacctctccg 


gggacgaaaa 


cctgatcagg 


gtcttccagg 


aggggaagga 


catccacacc 


1920 
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yctyyatyut 


/-i <~i 4-* 

cggcgucccc 


ccggaggccg 


tggaccccct 


gatgcgccgg 


1980 


cic*dci r* a ana 

y y y ctcty a. 


yy uyaaOLL 


cy gt-y uccuc 


tacggcaugt 


ccgcccatag 


gctctcccag 


2 040 


y cty v_ i_ l. y ^- t_ a 




gg a ggcggt:g 


gccttt atag 


agcgctactt 


ccaaagcttc 


2100 


i^oi—dcLy y uy l 


yygcctgyau 


rn /**r *?i ^ "Tv ^ ^ /— i 

dgadadyaC C 


ctggaggagg 


9"9* a 9"9* aa g c 9 


gggctacgtg 


2160 


na aarrci" 


t~ ffTfra a rra a pt 
y y ctdy ctcty 


gcgctacgtg 


cccgacctca 


acgcccgggt 


gaagagcgtc 




j j j yy 


<-y y ciy cy (_dL. 


ggccttcaac 


atgcccgtcc 


agggc ac cgc 


cgccgacctc 


22 80 


atgaagctcg 


ccatggtgaa 


gctcttcccc 


cgcctccggg 


agatgggggc 


ccgcatgctc 


2340 


ctccaggtcg 


ccaacgagct 


cctcctggag gccccccaag 


cgcgggccga 


ggaggtggcg 


2400 


gctttggcca 


aggaggccat 


ggagaaggcc 


tatcccctcg 


ccgtgcccct 


ggaggtggag 


2460 


gtggggatgg 


gggaggactg 


gctttccgcc 


aagggtcacc 


accaccacca 


ccac 


2514 



<210> 261 
<211> 838 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 261 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 
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Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala 
325 330 335 

Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 
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Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 

Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 
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<210> 262 

<211> 2526 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 262 
atgaattcca 


ccccactttt 


tgacctggag 


gaacccccca 


agcgggtgct 


tctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttctat 


gccctgagcc 


tcaccacctc 


ccggggggag 


120 


ccggtgcaga 


tggtctacgg 


cttcgcccgg 


agcctcctca 


aggccttgaa 


ggaggacgga 


180 


caggcggtgg 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacgag 


240 


gcctacaagg 


c 999 cc 999 c 


ccccaccccg 


gaggacttcc 


cccgccagct 


cgccttggtc 


300 


aagcggctgg 


tggaccttct 


gggcctggtc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


gacgtcctgg 


gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


acgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 


gggaccctgg 


tcaccccaaa 


ggacgtccag 


gagaagtacg 


gggtgccccc 


ggagcgctgg 


540 


gtggacttcc 


gcgccctcac 


gggggaccgc 


tcggacaaca 


tccccggggt 


ggcggggata 


600 


ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


aacctggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


720 


ctccacctct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca 


ccgggccccc 


1020 


gagccttata 


aagccctcag 


ggacctgaag 


gaggcgcggg 


ggcttctcgc 


caaagacctg 


1080 


agcgttctgg 


ccctgaggga 


aggccttggc 


ctcccgcccg 


gcgacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcccggcg 


ctacggcggg 


1200 


gagtggacgg 


aggaggcggg 


ggagcgggcc 


gccctttccg 


agaggctctt 


cgccaacctg 


1260 


cttaagaggc 


ttgaggggga 


ggagaggctc 


ctttggcttt 


accgggaggt 


ggagaggccc 


1320 


ctttccgctg 


tcctggccca 


tatggaggcc 


ac 99999tgc 


gcctggacgt 


ggcctatctc 


1380 


agggccttgt 


ccctggaggt 


ggccgaggag 


atcgcccgcc 


tcgaggccga 


ggtcttccgc 


1440 


ctggccggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


cctctttgac 


1500 


gagctagggc 


ttcccgccat 


caagaagacg 


caaaagaccg 


gcaagcgctc 


caccagcgcc 


1560 
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(T/-1 f~>rr\- ppf- ptpt 

ytty tc^ Ly y 


a ptptp 1 P 1 r*\~ r~* pt 

dyy ccc Lccy 


/-i p* — i ptpt p* c* a p> 

cydyy cccdc 


pipipia - ! - P 1 PT"l — pfpf 

cccdLcycyy 


dyddyaLCCL 


ycay Ldccgy 


1 (TOO 


pts PTP^i - p* a ppa 
y cty v_ l l ct l l ct 


anplrfa o na pt 
cty l. Lyddydy 


CdCCLdCdL L 


fia pi pi >—i pi -l— -l— /-rp 
y dece c l Ly c 


cyy dec ucdL 


ccdccccdyy 


J.OOU 


aLy y y l n- 


I - pi p 1 a p* a t~> r^rrv 

LLLaLaLULy 


C L L CddCCdy 


"3 pi PTPT ppa P PTPT 

dcy y c edey y 


ccdcyyyc dy 


y ceddy Ldy c 






aprlrpana a 
aLL LLUayaa 


pat" p* P 1 * p 1 ptprf- pi 
LctLLLLLy LL 


piPTpi a (*~< p»p^ p'ptp -1 
LyLaLLLLyL 


l Lyyy cdydy 


ydLccyccyy 




»~f /—i /— i 4- 4- /— t --\ 4— 

ycct.uca.ccy 


J-lf-*^ j~c/-V^ *">CP*T/*T 

c cgaggaggy 


/~t 4~~ /*f/*r/*i 4- 4— 4— y^r 

y Lyy CLaLLy 


gtggccctgg 


actatageca 


gatiagagctc 


1 Q A 


agggi-gctgg 


LLLaLLUCLC 


cyy eg acyay 


aaCCLydLCC 


gggtct t cca 


ggs-ggggegg 


T q o n 




cyy ayaccy c 


cay c cyyaLy 


LLCyyCyLCC 


cccgggaggc 


cgtggacccc 


lydu 


l Ly ciuy cy ll 


yyy c yy ccaa 


ydCCdLCaaC 


LLCyyyy lcc 


LCLacggcac 


y LCyyCCCdC 


n a a n 
Z U 4t U 


LyLL LLLLLL 


dyy dye l dye 


LdLCCCLLaC 


y ay y ayy c c c 


^ /^r /~r /~i /-i 4— 4— +-* 

aygee LLcac 


LgagcgcLac 


oi nn 


L L L C ciy dy C L 


uccccddyy l- 


gegggecugg 


aLcgayaay a 


cccl ggagga 


gggcaggagg 


ni en 
z io U 


cgggggtacg 


cyyay accc l 


ctt cggccgc 


cgccgctacg 


tgccagacct 


agaggecegg 


r> i o a 


prt" naarra pt p-'pt 
y Lyddy dy ey 


■H ptpiptpt/t r*f~tr+ 

^gegggagge 


ggccgagcgc 


dLyyCCL Lea 


acatgcccgt 


c c agggc ac c 


o o q n 
z z o U 


y LLy LLy dec 


LLd.Lyad.yLL 


yycuauyy eg 


aay c lc Luce 


ccaggctgga 


ggaaatgggg 


zo4U 


gecaggatge 


tccttcaggt 


cgccaacgag 


ctggtcctcg 


aggccccaaa 


agagagggcg 


2400 


gaggccgtgg 


cccggctggc 


caaggaggtc 


atggaggggg 


tgtatcccct 


ggccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


aggggaggac 


tggctctccg 


ccaaggagca 


ccaccaccac 


2520 


caccac 
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<210> 263 
<211> 842 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 263 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 
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Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 
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Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala lie Lys Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 
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Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 

<210> 264 
<211> 2514 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 264 
atgaat t ccc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


120 


caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


gtgatcgtgg 


tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtccacc 


gggcccccga 


gccttataaa 


1020 


gccctcaggg 


acctgaagga 


ggcgcggggg 


cttctcgcca 


aagacctgag 


cgttctggcc 


1080 


ctgagggaag 


gccttggcct 


cccgcccggc 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 
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gacccttcga acaccacccc cgagggggtg gcccggcgct acggcgggga gtggacggag 12 00 
gaggcggggg agcgggccgc cctttccgag aggctcttcg ccaacctgct taagaggctt 126 0 
gagggggagg agaggctcct ttggctttac cgggaggtgg agaggcccct ttccgctgtc 132 0 
ctggcccata tggaggccac gggggtgcgc ctggacgtgg cctatctcag ggccttgtcc 138 0 
ctggaggtgg ccgaggagat cgcccgcctc gaggccgagg tcttccgcct ggccggccac 1440 
cccttcaacc tcaactcccg ggaccagctg gaaagggtcc tctttgacga gctagggctt 1500 
cccgccatca agaagacgca aaagaccggc aagcgctcca ccagcgccgc cgtcctggag 1560 
gccctccgcg aggcccaccc catcgtggag aagatcctgc agtaccggga gctcaccaag 1620 
ctgaagagca cctacattga ccccttgccg gacctcatcc accccaggac gggccgcctc 1680 
cacacccgct tcaaccagac ggccacggcc acgggcaggc taagtagctc cgatcccaac 1740 
ctccagaaca tccccgtccg caccccgctt gggcagagga tccgccgggc cttcatcgcc 1800 
gaggaggggt ggctattggt ggccctggac tatagccaga tagagctcag ggtgctggcc 1860 
cacctctccg gcgacgagaa cctgatccgg gtcttccagg aggggcggga catccacacg 1920 
gagaccgcca gctggatgtt cggcgtcccc cgggaggccg tggaccccct gatgcgccgg 1980 
gcggccaaga ccatcaactt cggggtcctc tacggcatgt cggcccaccg cctctcccag 2 04 0 
gagctagcca tcccttacga ggaggcccag gccttcattg agcgctactt tcagagcttc 2100 
cccaaggtgc gggcctggat tgagaagacc ctggaggagg gcaggaggcg ggggtacgtg 2160 
gagaccctct tcggccgccg ccgctacgtg ccagacctag aggcccgggt gaagagcgtg 222 0 
cgggaggcgg ccgagcgcat ggccttcaac atgcccgtcc agggcaccgc cgccgacctc 2280 
atgaagctgg ctatggtgaa gctcttcccc aggctggagg aaatgggggc caggatgctc 2340 
cttcaggtcg ccaacgagct ggtcctcgag gccccaaaag agagggcgga ggccgtggcc 2400 
cggctggcca aggaggtcat ggagggggtg tatcccctgg ccgtgcccct ggaggtggag 24 60 
gtggggatag gggaggactg gctctccgcc aaggagcacc accaccacca ccac 2514 

<210> 265 
<211> 838 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 265 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 
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Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Pro 
325 330 335 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro 
355 360 365 
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Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 
420 425 430 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Lys Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr lie Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 
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Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

His His His His His His 
835 



<210> 266 

<211> 2505 

<212> DNA 

<213> Thermus thermophilus 

<400> 266 



atggaggcga 


tgcttccgct 


ctttgaaccc 


aaaggccggg 


tcctcctggt 


ggacggccac 


60 


cacctggcct 


accgcacctt 


cttcgccctg 


aagggcctca 


ccacgagccg 


gggcgaaccg 


120 


gtgcaggcgg 


tctacggctt 


cgccaagagc 


ctcctcaagg 


ccctgaagga 


ggacgggtac 


180 


aaggccgtct 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacgag 


240 


gcctacaagg 


cggggagggc 


cccgaccccc 


gaggacttcc 


cccggcagct 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggtttacc 


cgcctcgagg 


tccccggcta 


cgaggcggac 


360 


gacgttctcg 


ccaccctggc 


caagaaggcg 


gaaaaggagg 


ggtacgaggt 


gcgcatcctc 


420 


accgccgacc 


gcgacctcta 


ccaactcgtc 


tccgaccgcg 


tcgccgtcct 


ccaccccgag 


480 


ggccacctca 


tcaccccgga 


gtggctttgg 


gagaagtacg 


gcctcaggcc 


ggagcagtgg 


540 


gtggacttcc 


gcgccctcgt 


gggggacccc 


tccgacaacc 


tccccggggt 


caagggcatc 


600 


ggggagaaga 


ccgccctcaa 


gctcctcaag 


gagtggggaa 


gcctggaaaa 


cctcctcaag 


660 


aacctggacc 


gggtaaagcc 


agaaaacgtc 


cgggagaaga 


tcaaggccca 


cctggaagac 


720 


ctcaggctct 


ccttggagct 


ctcccgggtg 


cgcaccgacc 


tccccctgga 


ggtggacctc 


780 


gcccaggggc 


gggagcccga 


ccgggagggg 


cttagggcct 


tcctggagag 


gctggagttc 


840 
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ggcagcctcc tccacgagtt cggcctcctg 
tggcccccgc cggaaggggc cttcgtgggc 
gcggagctta aagccctggc cgcctgcagg 
ttggcggggc taaaggacct caaggaggtc 
ttggcctcga gggaggggct agacctcgtg 
ctcctggacc cctccaacac cacccccgag 
acggaggacg ccgcccaccg ggccctcctc 
cgcctcgagg gggaggagaa gctcctttgg 
cgggtcctgg cccacatgga ggccaccggg 
ctttccctgg agcttgcgga ggagatccgc 
ggccacccct tcaacctcaa ctcccgggac 
aggcttcccg ccttggggaa gacgcaaaag 
ctggaggccc tacgggaggc ccaccccatc 
accaagctca agaacaccta cgtggacccc 
cgcctccaca cccgcttcaa ccagacggcc 
cccaacctgc agaacatccc cgtccgcacc 
gtggccgagg cgggttgggc gttggtggcc 
ctcgcccacc tctccgggga cgaaaacctg 
cacacccaga ccgcaagctg gatgttcggc 
cgccgggcgg ccaagacggt gaacttcggc 
tcccaggagc ttgccatccc ctacgaggag 
agcttcccca aggtgcgggc ctggatagaa 
tacgtggaaa ccctcttcgg aagaaggcgc 
agcgtcaggg aggccgcgga gcgcatggcc 
gacctcatga agctcgccat ggtgaagctc 
atgctcctcc aggtccacga cgagctcctc 
gtggcggctt tggccaagga ggccatggag 
gtggaggtgg ggatggggga ggactggctt 

<210> 267 
<211> 834 
<212> PRT 

<213> Thermus aquaticus 
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<400> 267 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg 
130 135 140 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 160 

Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val His Arg 
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325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 
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Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly 



<210> 268 
<211> 832 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 268 

Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
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90 



95 



Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Glu Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 



Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 
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Leu Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Asp Arg lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr lie Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg lie Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Gin Asp lie His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser lie Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 
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Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 



Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 



<210> 269 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 269 

gcgcggcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 60 
cgc 63 



<210> 270 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 270 

gcgctagggc gctggcaagt gtagcggtca 3 0 



<210> 271 

<211> 35 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 271 

gatcgctgcg cgtaaccacc acacccgcgc cgcgc 35 



<210> 272 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 272 

ggactctgcc tcaagacggt agtcaacgtg 30 



<210> 273 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 273 

cacgttgact accgtc 16 



<210> 274 
<211> 34 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 274 

catgtcaagc agtcctaact ttgaggcaga gtcc 34 



<210> 275 
<211> 36 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 275 

cagaccatga attccacccc actttttgac ctggag 3 6 



<210> 276 

<211> 36 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 276 

gtggacgcgg ccgcccgagg ccgccgccag ggccag 3 6 



<210> 277 

<211> 36 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 277 

cagaccatga attccctgcc cctctttgag cccaag 



36 



<210> 278 

<211> 36 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 278 

gtaaaccgcg ccgccccagg cggcggccaa ggcgtt 36 
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CLAIMS 



We claim: 



5 1. A composition comprising an enzyme, wherein said enzyme comprises a 

heterologous functional domain, wherein said heterologous functional domain provides altered 
functionality in a nucleic acid cleavage assay. 



10 



,15 



2. The composition of Claim 1, wherein said enzyme comprises a 5' nuclease. 

3. The composition of Claim 2, wherein said 5' nuclease comprises a thermostable 
5' nuclease. 

4. The composition of Claim 1, wherein said enzyme comprises a polymerase. 

5. The composition of Claim 4, wherein said polymerase is altered in sequence 
relative to a naturally occurring sequence of a polymerase such that it exhibits reduced DNA 
synthetic activity from that of the naturally occurring polymerase. 

6. The composition of Claim 4, wherein said polymerase comprises a thermostable 
polymerase. 

7. The composition of Claim 6, wherein said thermostable polymerase comprises a 
polymerase from a Thermus species. 

8. The composition of Claim 7, wherein said Thermus species is selected from 
Thermus aquaticus, Thermus flavus, Thermus thermopMus, Thermus filiformus, and Thermus 
scotoductus. 



30 9 - The composition of Claim 1, wherein said heterologous functional domain 
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25 



comprises an amino acid sequence that provides an improved nuclease activity in said nucleic 
acid cleavage assay. 

10. The composition of Claim 1, wherein said heterologous functional domain 

5 comprises an amino acid sequence that provides an improved substrate binding activity in said 
nucleic acid cleavage assay. 

11. The composition of Claim 1, wherein said heterologous functional domain 
comprises an amino acid sequence that provides improved background specificity in said 

10 nucleic acid cleavage assay. 

12. The composition of Claim 1, wherein said heterologous functional domain 
^ comprises two or more amino acids from a polymerase domain of a polymerase. 

13. The composition of Claim 12, wherein at least one of said two or more amino 
f i acids is from a palm region of said polymerase domain. 



Q 14. The composition of Claim 12, wherein at least one of said two or more amino 

acids is from a thumb region of said polymerase domain. 

t° 

Q 15. The composition of Claim 12, wherein said polymerase comprises Thermus 

thermophilics polymerase. 

16. The composition of Claim 12, wherein said two or amino acids from said 

25 polymerase domain comprise two or more amino acids from amino acids 300-650 of SEQ ID 
NO:267. 

17. The composition of Claim 1, wherein said enzyme comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOs:75, 77, 79, 81, 83, 85, 87, 89, 

30 91, 93, 95, 97, 99, 101, 103, 106, 109, 112, 115, 118, 121, 124, 127, 130, 133, 136, 139, 
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142, 145, 148, 150, 153, 157, 160, 163, 166, 169, 172, 175, 178, 181, 184, 187, 190, 200, 
202, 204, 206, 212, 214, 216, 218, 221, 226, 228, 230, 232, 234, 236, 239, 241, 243, 251, 
259, 261, 263, and 265. 

5 18. The composition of Claim 1, wherein said nucleic acid cleavage assay 

comprises cleavage of a DNA member of a substrate containing at least one RNA component. 

19. The composition of Claim 1, wherein said nucleic acid cleavage assay 
comprises an invasive cleavage assay. 

10 

20. A composition comprising a nucleic acid encoding the enzyme of Claim 1. 

yg 21. The composition of Claim 20, wherein said nucleic acid is selected from the 

}f] group consisting of SEQ ID NOs:74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 

;15 105, 108, 111, 114, 117, 120, 123, 126, 129, 132, 135, 138, 141, 144, 147, 149, 152, 156, 

O 159, 162, 165, 168, 171, 174, 177, 180, 183, 186, 189, 199, 201, 203, 205, 211, 213, 215, 

* 217, 220, 225, 227, 229, 231, 233, 235, 238, 240, 242, 250, 258, 260, 262, and 264. 

fjji 22. The composition of Claim 20, further comprising an expression vector operably 

5p linked to said nucleic acid. 

23. A composition comprising a host cell containing the composition of Claim 22. 

24. A method for producing an altered enzyme with improved functionality in a 
25 nucleic acid cleavage assay comprising: 

a) providing an enzyme and a nucleic acid test substrate; 

b) introducing a heterologous functional domain into said enzyme to 
produce an altered enzyme; 

c) contacting said altered enzyme with said nucleic acid test substrate to 
30 produce cleavage products; and 
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d) detecting said cleavage products. 



25. The method of Claim 24, wherein said introducing a heterologous functional 
domain comprises mutating one or more amino acids of said enzyme. 

26. The method of Claim 24, wherein said introducing a heterologous functional 
domain into said enzyme comprises adding a functional domain from a protein into said 
enzyme. 

27. The method Claim 26, wherein said adding a functional domain from a protein 
into said enzyme comprising removing a portion of said enzyme sequence prior to adding said 
functional domain of said protein. 

28. The method of Claim 24, wherein said nucleic acid test substrate comprises a 
cleavage structure. 

29. The method of Claim 28, wherein said cleavage structure comprises an RNA 
target nucleic acid. 

30. The method of Claim 28, wherein said cleavage structure comprises an invasive 
cleavage structure. 

31. The method of Claim 24, wherein said enzyme comprises a 5' nuclease. 

32. The method of Claim 31, wherein said 5' nuclease comprises a thermostable 5' 
nuclease. 

33. The method of Claim 24, wherein said enzyme comprises a polymerase. 

34. The method of Claim 33, wherein said polymerase is altered in sequence 
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relative to a naturally occurring sequence of a polymerase such that it exhibits reduced DNA 
synthetic activity from that of the naturally occurring polymerase. 

35. The method of Claim 34, wherein said polymerase comprises a thermostable 
polymerase. 

36. The method of Claim 35, wherein said thermostable polymerase comprises a 
polymerase from a Thermus species. 

37. The method of Claim 36, wherein said Thermus species is selected from 
Thermus aquaticus, Thermus flavus, Thermus thermophilus, Thermus filiformus, and Thermus 
scotoductus. 

38. The method of Claim 24, wherein said heterologous functional domain 
comprises an amino acid sequence that provides an improved nuclease activity in said nucleic 
acid cleavage assay. 

39. The method of Claim 24, wherein said heterologous functional domain 
comprises an amino acid sequence that provides an improved substrate binding activity in said 
nucleic acid cleavage assay. 

40. The method of Claim 24, wherein said heterologous functional domain 
comprises an amino acid sequence that provides improved background specificity in said 
nucleic acid cleavage assay. 

41. The method of Claim 24, wherein said heterologous functional domain 
comprises two or more amino acids from a polymerase domain of a polymerase. 

42. The method of Claim 41, wherein at least one of said two or more amino acids 
is from a palm region of said polymerase domain. 
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43. The method of Claim 41, wherein at least one of said two or more amino acids 
is from a thumb region of said polymerase domain. 

44. The method of Claim 41, wherein said polymerase comprises Thermus 
5 thermophilus polymerase. 

45. The method of Claim 41, wherein said two or amino acids from said 
polymerase domain comprise two or more amino acids from amino acids 300-650 of SEQ ID 
NO:267. 

10 

46. The method of Claim 24, wherein said enzyme comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOs:75, 77, 79, 81, 83, 85, 87, 89, 

J 91, 93, 95, 97, 99, 101, 103, 106, 109, 112, 115, 118, 121, 124, 127, 130, 133, 136, 139, 
J 142, 145, 148, 150, 153, 157, 160, 163, 166, 169, 172, 175, 178, 181, 184, 187, 190, 200, 
% i5 202, 204, 206, 212, 214, 216, 218, 221, 226, 228, 230, 232, 234, 236, 239, 241, 243, 251, 
O 259, 261, 263, and 265. 

""It™ 

O 47. The method of Claim 24, wherein said altered enzyme comprises an amino acid 

flj sequence selected from the group consisting of SEQ ID NOs:75, 77, 79, 81, 83, 85, 87, 89, 
!§0 91, 93, 95, 97, 99, 101, 103, 106, 109, 112, 115, 118, 121, 124, 127, 130, 133, 136, 139, 
O 142, 145, 148, 150, 153, 157, 160, 163, 166, 169, 172, 175, 178, 181, 184, 187, 190, 200, 

202, 204, 206, 212, 214, 216, 218, 221, 226, 228, 230, 232, 234, 236, 239, 241, 243, 251, 

259, 261, 263, and 265. 

25 48. An altered enzyme produced by the method of Claim 24. 

49. A kit comprising the altered enzyme of Claim 48. 

50. A kit comprising the composition of Claim 1. 

30 
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51. The kit of Claim 50, further comprising at least one nucleic acid cleavage 
substrate. 

52. The kit of Claim 51, further comprising at least one RNA capable of 
hybridizing to said at nucleic acid cleavage substrate. 

53. The kit of Claim 50, further comprising a labeled oligonucleotide. 

54. The kit of Claim 50, further comprising an invasive oligonucleotide. 

55. A method for cleaving a nucleic acid comprising: 

a) providing: 

i) the enzyme of Claim 1; and 

ii) a substrate nucleic acid; and 

b) exposing said substrate nucleic acid to said enzyme. 

56. The method of Claim 55, wherein said exposing said substrate nucleic acid to 
said enzyme produces at least one cleavage product. 

57. The method of Claim 56, further comprising the step of c) detecting said 
cleavage product. 
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ABSTRACT 



The present invention relates to novel enzymes designed for direct detection, 
characterization and quantitation of nucleic acids, particularly RNA. The present invention 
provides enzymes that recognize specific nucleic acid cleavage structures formed on a target 
RNA sequence and that cleave the nucleic acid cleavage structure in a site-specific manner to 
produce non-target cleavage products. The present invention provides enzymes having an 
improved ability to specifically cleave a DNA member of a complex comprising DNA and 
RNA nucleic acid strands. 
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3 ' NH4 -AATTGCTCCGCGTGGTTGACGAAGGAGGC- 5 

^CAACTGCTTCCTCCG- 3 



t 



(SEQ ID NO:30) 
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(SEQ ID NO:33) (SEQ ID NO:32) 

-ACGAGCGTCTTT CGCTGTCTCGCT 
TTGCTCGCAGAAA GCGAC AGAGCGA- F 1 - 

t 

(SEQ ID NO:34) 
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^S* (SEQ ID NO:26) 



_ C AACTGCCGTGA 

CGCACG 

G GCGUGGUUGACGGCACU-5 ' 
t 

(SEQ ID NO:35) 



(SEQ ID NO:26) 

CAACTGCCGTGA 
A CGCACG 
A G GCGTGGTTGACGGCACT-5' 

t 

(SEQ ID NO:36) 
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